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EX03-029P 

MPTENS AS MODIFIERS OF THE PTEN/IGF PATHWAY AND METHODS OF 

USE 

BACKGROUND OF THE INVENTION 

Somatic mutations in the PTEN (Phosphatase and Tensin homolog deleted on 
chromosome 10) gene are known to cause tumors in a variety of human tissues. In 
addition, germline mutations in PTEN are the cause of human diseases (Cowden disease 
and Bannayan-Zonana syndrome) associated with increased risk of breast and thyroid 
cancer (Nelen MR et al. (1997) Hum Mol Genet, 8:1383-1387; Liaw D et al. (1997) Nat 
Genet, 1:64-67; Marsh DJ et al. (1998) Hum Mol Genet, 3:507-515). PTEN is thought to 
act as a tumor suppressor by regulating several signaling pathways through the second 
messenger phosphatidylinositol 3,4,5 triphosphate (PIP3). PTEN dephosphorylates the 
D3 position of PIP3 and downregulates signaling events dependent on PIP3 levets 
(Maehama T and Dixon JE (1998) J Biol Chem, 22, 13375-8). In particular, pro-survival 
pathways downstream of the insulin-like growth factor (IGF) pathway are regulated by 
PTEN activity. Stimulation of the IGF pathway, or loss of PTEN function, elevates PIP3 
levels and activates pro-survival pathways associated with tumorigenesis (Stambolic V et 
al. (1998) Cell, 95:29-39). Consistent with this model, elevated levels of insulin-like 
growth factors I and II correlate with increased risk of cancer (Yu H et al (1999) J Natl 
Cancer Inst 91:151-156) and poor prognosis (Takanami I et al, 1996, J Surg Oncol 
61(3):205-8). 

PTEN sequence is conserved in evolution, and exists in mouse (Hansen GM and 
Justice MJ (1998) Mamm Genome, 9(l):88-90), Drosophila (Goberdhan DC et al (1999) 
Genes and Dev, 24:3244-58; Huang H et al (1999) Development 23:5365-72), and C. 
elegans (Ogg S and Ruvkun G, (1998) Mol Cell, (6):887-93). Studies in these model 
organisms have helped to elucidate the role of PTEN in processes relevant to 
tumorigenesis. In Drosophila, the PTEN homolog (dPTEN) has been shown to regulate 
cell size, survival, and proliferation (Huang et al, supra\ Goberdhan et al, supra; Gao X et 
al, 2000, 221:404-418). In mice, loss of PTEN function increases cancer susceptibility 
(Di Cristofano A et al (1998) Nature Genetics, 19:348-355; Suzuki A et al (1998) Curr. 
Biol., 8:1169-78). 
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In addition, a member of the IGF/insulin receptor family exists in Drosophila and 
has been shown to respond to insulin stimulation (Fernandez-Almonacid R, and Rozen 
OM (1987) Mol Cell Bio, (8):27 18-27). Similar to PTEN, studies in Drosophila 
(Brogiolo W et al (2001) Curr Biol, 1 1(4):213-21) and mouse (Moorehead RA et al 
(2003) Oncogene, 22(6):853-857) establish a conserved role for the IGF/insulin pathway 
in growth control. 

The ability to manipulate the genomes of model organisms such as Drosophila 
provides a powerful means to analyze biochemical processes that, due to significant 
evolutionary conservation, have direct relevance to more complex vertebrate 
organisms. Due to a high level of gene and pathway conservation, the strong 
similarity of cellular processes, and the functional conservation of genes between 
these model organisms and mammals, identification of the involvement of novel 
genes in particular pathways and their functions in such model organisms can directly 
contribute to the understanding of the correlative pathways and methods of 
modulating them in mammals (see, for example, Mechler BM et al., 1985 EMBO J 
4:1551-1557; Gateff E. 1982 Adv. Cancer Res. 37: 33-74; Watson KL., et al., 1994 J 
CellSci. 18: 19-33; MiklosGL, and Rubin GM. 1996 Cell 86:521-529; Wassarman 
DA, et al., 1995 Curr Opin Gen Dev 5: 44-50; and Booth DR. 1999 Cancer 
Metastasis Rev. 18: 261-284). For example, a genetic screen can be carried out in an 
invertebrate model organism or cell having underexpression (e.g. knockout) or 
overexpression of a gene (referred to as a "genetic entry point") that yields a visible 
phenotype, such as altered cell growth. Additional genes are mutated in a random or 
targeted manner. When a gene mutation changes the original phenotype caused by 
the mutation in the genetic entry point, the gene is identified as a "modifier" involved 
in the same or overlapping pathway as the genetic entry point. When inactivation of 
either gene is not lethal, but inactivation of both genes results in reduced viability or 
death of the cell, tissue, or organism, the interaction is defined as "synthetic lethal" 
(Bender, A andPringle J, (1991) Mol Cell Biol, 11:1295-1305; Hartman J et al, 
(2001) Science 291:1001-1004; US PAT No:6,489,127). In a synthetic lethal 
interaction, the modifier may also be identified as an "interactor". When the genetic 
entry point is an ortholog of a human gene implicated in a disease pathway, such as 
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the IGF pathway, modifier genes can be identified that may be attractive candidate 
targets for novel therapeutics. 

AH references cited herein, including patents, patent applications, publications, 
and sequence information in referenced Genbank identifier numbers, are incorporated 
herein in their entireties. 

SUMMARY OF THE INVENTION 

We have discovered genes that modify the PTEN/IGF pathway in Drosophila 
cells, and identified their human orthologs, hereinafter referred to as modifier of 
PTEN/IGF (MPTEN). The invention provides methods for utilizing these PTEN/IGF 
modifier genes and polypeptides to identify MPTEN-modulating agents that are 
candidate therapeutic agents that can be used in the treatment of disorders associated 
with defective or impaired PTEN/IGF function and/or MPTEN function. Preferred 
MPTEN-modulating agents specifically bind to MPTEN polypeptides and restore 
PTEN/IGF function. Other preferred MPTEN-modulating agents are nucleic acid 
modulators such as antisense oligomers and RNAi that repress MPTEN gene 
expression or product activity by, for example, binding to and inhibiting the 
respective nucleic acid (i.e. DNA or mRNA). 

MPTEN modulating agents may be evaluated by any convenient in vitro or in 
vivo assay for molecular interaction with an MPTEN polypeptide or nucleic acid. In 
one embodiment, candidate MPTEN modulating agents are tested with an assay 
system comprising a MPTEN polypeptide or nucleic acid. Agents that produce a 
change in the activity of the assay system relative to controls are identified as 
candidate PTEN/IGF modulating agents. The assay system may be cell-based or cell- 
free. MPTEN-modulating agents include MPTEN related proteins (e.g. dominant 
negative mutants, and biotherapeutics); MPTEN -specific antibodies; MPTEN - 
specific antisense oligomers and other nucleic acid modulators; and chemical agents 
that specifically bind to or interact with MPTEN or compete with MPTEN binding 
partner (e.g. by binding to an MPTEN binding partner). In one specific embodiment, 
a small molecule modulator is identified using a binding assay. In specific 
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embodiments, the screening assay system is selected from an apoptosis assay, a cell 
proliferation assay, an angiogenesis assay, and a hypoxic induction assay. 

In another embodiment, candidate PTEN/IGF pathway modulating agents are 
further tested using a second assay system that detects changes in the PTEN/IGF 
pathway, such as angiogenic, apoptotic, or cell proliferation changes produced by the 
originally identified candidate agent or an agent derived from the original agent. The 
second assay system may use cultured cells or non-human animals. In specific 
embodiments, the secondary assay system uses non-human animals, including 
animals predetermined to have a disease or disorder implicating the PTEN/IGF 
pathway, such as an angiogenic, apoptotic, or cell proliferation disorder (e.g. cancer). 

The invention further provides methods for modulating the MPTEN function 
and/or the PTEN/IGF pathway in a mammalian cell by contacting the mammalian cell 
with an agent that specifically binds a MPTEN polypeptide or nucleic acid. The 
agent may be a small molecule modulator, a nucleic acid modulator, or an antibody 
and may be administered to a mammalian animal predetermined to have a pathology 
associated the PTEN/IGF pathway. 

DETAILED DESCRIPTION OF THE INVENTION 

The PTEN co-RNAi plus insulin synthetic lethal screen was designed to identify 
modifier genes that are lethal or reduce proliferation in cells with a hyperstimulated 
IGF/insulin pathway, but not in normal cells. We created cells with a hyperstimulated 
IGF/insulin pathway by treatment with insulin and RNAi-mediated inactivation of 
dPTEN, the Drosophila homologue of the human tumor suppressor PTEN. In addition to 
identifying genes with synthetic lethal interactions in insulin-treated, PTEN-deficient 
cells, this screen identified genes that, when inactivated, preferentially reduced the 
viability of insulin-treated, PTEN-deficient cells relative to normal cells. Modifier genes 
haing a synthetic interaction with the PTEN/IGF pathway were identified. Accordingly, 
vertebrate orthologs of these modifiers, and preferably the human orthologs. MPTEN 
genes (i.e., nucleic acids and polypeptides) are attractive drug targets for the treatment of 
pathologies associated with a defective IGF signaling pathway, such as cancer. Table 1 
lists the modifiers and their orthologs. 
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In vitro and in vivo methods of assessing MPTEN function are provided herein. 
Modulation of the MPTEN or their respective binding partners is useful for 
understanding the association of the PTEN/IGF pathway and its members in normal 
and disease conditions and for developing diagnostics and therapeutic modalities for 
PTEN/IGF related pathologies. MPTEN-modulating agents that act by inhibiting or 
enhancing MPTEN expression, directly or indirectly, for example, by affecting an 
MPTEN function such as enzymatic (e.g., catalytic) or binding activity, can be 
identified using methods provided herein. MPTEN modulating agents are useful in 
diagnosis, therapy and pharmaceutical development. 

Nucleic acids and polypeptides of the invention 

Sequences related to MPTEN nucleic acids and polypeptides that can be used in 
the invention are disclosed in Genbank (referenced by Genbank identifier (GI) or 
RefSeq number), shown in Table 1 and in the appended sequence listing. 

The term "MPTEN polypeptide" refers to a full-length MPTEN protein or a 
functionally active fragment or derivative thereof. A "functionally active" MPTEN 
fragment or derivative exhibits one or more functional activities associated with a 
full-length, wild-type MPTEN protein, such as antigenic or immunogenic activity, 
enzymatic activity, ability to bind natural cellular substrates, etc. The functional 
activity of MPTEN proteins, derivatives and fragments can be assayed by various 
methods known to one skilled in the art (CuiTent Protocols in Protein Science (1998) 
Coligan et al, eds., John Wiley & Sons, Inc., Somerset, New Jersey) and as further 
discussed below. In one embodiment, a functionally active MPTEN polypeptide is a 
MPTEN derivative capable of rescuing defective endogenous MPTEN activity, such 
as in cell based or animal assays; the rescuing derivative may be from the same or a 
different species. For purposes herein, functionally active fragments also include 
those fragments that comprise one or more structural domains of an MPTEN, such as 
a binding domain. Protein domains can be identified using the PFAM program 
(Bateman A., et al.. Nucleic Acids Res, 1999, 27:260-2). Methods for obtaining 
MPTEN polypeptides are also further described below. In some embodiments, 
preferred fragments are functionally active, domain-containing fragments comprising 
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at least 25 contiguous amino acids, preferably at least 50, more preferably 75, and 
most preferably at least 100 contiguous amino acids of an MPTEN. In further 
preferred embodiments, the fragment comprises the entire functionally active domain. 

The term "MPTEN nucleic acid" refers to a DNA or RNA molecule that encodes 
a MPTEN polypeptide. Preferably, the MPTEN polypeptide or nucleic acid or 
fragment thereof is from a human, but can also be an ortholog, or derivative thereof 
with at least 70% sequence identity, preferably at least 80%, more preferably 85%, 
still more preferably 90%, and most preferably at least 95% sequence identity with 
human MPTEN. Methods of identifying orthlogs are known in the art. Normally, 
orthologs in different species retain the same function, due to presence of one or more 
protein motifs and/or 3-dimensional structures. Orthologs arc generally identified by 
sequence homology analysis, such as BLAST analysis, usually using protein bait 
sequences. Sequences are assigned as a potential ortholog if the best hit sequence 
from the forward BLAST result retrieves the original query sequence in the reverse 
BLAST (Huynen MA and Bork P, Proc Natl Acad Sci (1998) 95:5849-5856; Huynen 
MA et aL, Genome Research (2000) 10:1204-1210). Programs for multiple sequence 
alignment, such as CLUSTAL (Thompson JD et al, 1994, Nucleic Acids Res 
22:4673-4680) may be used to highlight conserved regions and/or residues of 
orthologous proteins and to generate phylogenetic trees. In a phylogenetic tree 
representing multiple homologous sequences from diverse species (e.g., retrieved 
through BLAST analysis), orthologous sequences from two species generally appear 
closest on the tree with respect to all other sequences from these two species. 
Structural threading or other analysis of protein folding (e.g., using software by 
ProCeryon, Biosciences, Salzburg, Austria) may also identify potential orthologs. In 
evolution, when a gene duplication event follows speciation, a single gene in one 
species, such as Drosophila, may correspond to multiple genes (paralogs) in another, 
such as human. As used herein, the term "orthologs" encompasses paralogs. As used 
herein, "percent (%) sequence identity" with respect to a subject sequence, or a 
specified portion of a subject sequence, is defined as the percentage of nucleotides or 
amino acids in the candidate derivative sequence identical with the nucleotides or 
amino acids in the subject sequence (or specified portion thereof), after aligning the 
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sequences and introducing gaps, if necessary to achieve the maximum percent 
sequence identity, as generated by the program WU-BLAST-2.0al9 (Altschul el al, 
J. MoL Biol. (1997) 215:403-410) with all the search parameters set to default values. 
The HSP S and HSP S2 parameters are dynamic values and are established by the 
program itself depending upon the composition of the particular sequence and 
composition of the particular database against which the sequence of interest is being 
searched. A % identity value is determined by the number of matching identical 
nucleotides or amino acids divided by the sequence length for which the percent 
identity is being reported. "Percent (%) amino acid sequence similarity" is 
determined by doing the same calculation as for determining % amino acid sequence 
identity, but including conservative amino acid substitutions in addition to identical 
amino acids in the computation. 

A conservative amino acid substitution is one in which an amino acid is 
substituted for another amino acid having similar properties such that the folding or 
activity of the protein is not significantly affected. Aromatic amino acids that can be 
substituted for each other are phenylalanine, tryptophan, and tyrosine; 
interchangeable hydrophobic amino acids are leucine, isoleucine, methionine, and 
valine; interchangeable polar amino acids are glutamine and asparagine; 
interchangeable basic amino acids are arginine, lysine and histidine; interchangeable 
acidic amino acids are aspartic acid and glutamic acid; and interchangeable small 
amino acids are alanine, serine, threonine, cysteine and glycine. 

Alternatively, an alignment for nucleic acid sequences is provided by the local 
homology algorithm of Smith and Waterman (Smith and Waterman, 1981, Advances 
in Applied Mathematics 2:482-489; database: European Bioinformatics Institute; 
Smith and Waterman, 1981, J. of Molec.Biol., 147:195-197; Nicholas et al., 1998, "A 
Tutorial on Searching Sequence Databases and Sequence Scoring Methods" 
(www.psc.edu) and references cited therein.; W.R. Pearson, 1991, Genomics 1 1:635- 
650). This algorithm can be applied to amino acid sequences by using the scoring 
matrix developed by Dayhoff (Dayhoff: Atlas of Protein Sequences and Structure, M. 
O. Dayhoff ed., 5 suppl. 3:353-358, National Biomedical Research Foundation, 
Washington, D.C, USA), and normalized by Gribskov (Gribskov 1986 Nucl. Acids 
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Res. 14(6):6745-6763). The Smith-Waterman algorithm may be employed where 
default parameters are used for scoring (for example, gap open penalty of 12, gap 
extension penalty of two). From the data generated, the "Match" value reflects 
"sequence identity." 

Derivative nucleic acid molecules of the subject nucleic acid molecules include 
sequences that hybridize to the nucleic acid sequence of an MPTEN. The stringency 
of hybridization can be controlled by temperature, ionic strength, pH, and the 
presence of denaturing agents such as formamide during hybridization and washing. 
Conditions routinely used are set out iri readily available procedure texts (e.g.. 
Current Protocol in Molecular Biology, Vol. 1, Chap. 2.10, John Wiley & Sons, 
Publishers (1994); Sambrook et at., Molecular Cloning, Cold Spring Harbor (1989)). 
In some embodiments, a nucleic acid molecule of the invention is capable of 
hybridizing to a nucleic acid molecule of an MPTEN under high stringency 
hybridization conditions that are: prehybridization of filters containing nucleic acid 
for 8 hours to overnight at 65° C in a solution comprising 6X single strength citrate 
(SSC) (IX SSC is 0.15 M NaCl, 0.015 M Na citrate; pH 7.0), 5X Denhardts solution, 
0.05% sodium pyrophosphate and 100 lig/m\ herring sperm DNA; hybridization for 
18-20 hours at 65° C in a solution containing 6X SSC, IX Denhardt's solution, 100 
/ig/ml yeast tRNA and 0.05% sodium pyrophosphate; and washing of filters at 65° C 
for lh in a solution containing 0.1X SSC and 0.1% SDS (sodium dodecyl sulfate). 

In other embodiments, moderately stringent hybridization conditions are used that 
are: pretreatment of filters containing nucleic acid for 6 h at 40° C in a solution 
containing 35% formamide, 5X SSC, 50 mMTris-HCl (pH7.5), 5mM EDTA, 0.1% 
PVP, 0.1% Ficoll, 1% BSA, and 500 jxg/ml denatured salmon sperm DNA; 
hybridization for 18-20h at 40° C in a solution containing 35% formamide, 5X SSC, 
50 mM Tris-HCl (pH7.5), 5mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 
jig/ml salmon sperm DNA, and 10% (wt/vol) dextran sulfate; followed by washing 
twice for 1 hour at 55° C in a solution containing 2X SSC and 0.1% SDS. 

Alternatively, low stringency conditions can be used that are: incubation for 8 
hours to overnight at 37° C in a solution comprising 20% formamide, 5 x SSC, 50 
mM sodium phosphate (pH 7.6), 5X Denhardt's solution, 10% dextran sulfate, and 20 
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/zg/ml denatured sheared salmon sperm DNA; hybridization in the same buffer for 18 
to 20 hours; and washing of filters in 1 x SSC at about 37° C for I hour. 

Isolation, Production, Expression, and Mis-expression of MPTEN Nucleic 
Acids and Polypeptides 

MPTEN nucleic acids and polypeptides, are useful for identifying and testing 
agents that modulate MPTEN function and for other applications related to the 
involvement of MPTEN in the PTEN/IGF pathway. MPTEN nucleic acids and 
derivatives and orthologs thereof may be obtained using any available method. For 
instance, techniques for isolating cDNA or genomic DNA sequences of interest by 
screening DNA libraries or by using polymerase chain reaction (PCR) are well known 
in the art. In general, the particular use for the protein will dictate the particulars of 
expression, production, and purification methods. For instance, production of 
proteins for use in screening for modulating agents may require methods that preserve 
specific biological activities of these proteins, whereas production of proteins for 
antibody generation may require structural integrity of particular epitopes. 
Expression of proteins to be purified for screening or antibody production may 
require the addition of specific tags {e.g., generation of fusion proteins). 
Ovcrexpression of an MPTEN protein for assays used to assess MPTEN function, 
such as involvement in cell cycle regulation or hypoxic response, may require 
expression in eukaryotic cell lines capable of these cellular activities. Techniques for 
the expression, production, and purification of proteins are well known in the art; any 
suitable means therefore may be used (e.g., Higgins SJ and Hames BD (eds.) Protein 
Expression: A Practical Approach, Oxford University Press Inc., New York 1999; 
Stanbury PF et al., Principles of Fermentation Technology, 2 nd edition, Elsevier 
Science, New York, 1995; Doonan S (ed.) Protein Purification Protocols, Humana 
Press, New Jersey, 1996; Coligan JE et al, Current Protocols in Protein Science (eds ), 
1999, John Wiley & Sons, New York). In particular embodiments, recombinant 
MPTEN is expressed in a cell line known to have defective PTEN/IGF function. The 
recombinant cells are used in cell-based screening assay systems of the invention, as 
described further below. 
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The nucleotide sequence encoding an MPTEN polypeptide can be inserted into 
any appropriate expression vector. The necessary transcriptional and translational 
signals, including promoter/enhancer element, can derive from the native MPTEN 
gene and/or its flanking regions or can be heterologous. A variety of host-vector 
expression systems may be utilized, such as mammalian cell systems infected with 
virus (e.g. vaccinia virus, adenovirus, etc.); insect cell systems infected with virus 
(e.g. baculovirus); microorganisms such as yeast containing yeast vectors, or bacteria 
transformed with bacteriophage, plasmid, or cosmid DNA. An isolated host cell 
strain that modulates the expression of, modifies, and/or specifically processes the 
gene product may be used. 

To detect expression of the MPTEN gene product, the expression vector can 
comprise a promoter operably linked to an MPTEN gene nucleic acid, one or more 
origins of replication, and, one or more selectable markers (e.g. thymidine kinase 
activity, resistance to antibiotics, etc.). Alternatively, recombinant expression vectors 
can be identified by assaying for the expression of the MPTEN gene product based on 
the physical or functional properties of the MPTEN protein in in vitro assay systems 
(e.g. immunoassays). 

The MPTEN protein, fragment, or derivative may be optionally expressed as a 
fusion, or chimeric protein product (i.e. it is joined via a peptide bond to a 
heterologous protein sequence of a different protein), for example to facilitate 
purification or detection. A chimeric product can be made by ligating the appropriate 
nucleic acid sequences encoding the desired amino acid sequences to each other using 
standard methods and expressing the chimeric product. A chimeric product may also 
be made by protein synthetic techniques, e.g. by use of a peptide synthesizer 
(Hunkapiller etaL. Nature (1984) 310:105-111). 

Once a recombinant cell that expresses the MPTEN gene sequence is identified, 
the gene product can be isolated and purified using standard methods (e.g. ion 
exchange, affinity, and gel exclusion chromatography; centrifugation; differential 
solubility; electrophoresis). Alternatively, native MPTEN proteins can be purified 
from natural sources, by standard methods (e.g. immunoaffinity purification). Once a 
protein is obtained, it may be quantified and its activity measured by appropriate 
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methods, such as immunoassay, bioassay, or other measurements of physical 
properties, such as crystallography. 

The methods of this invention may also use cells that have been engineered for 
altered expression (mis-expression) of MPTEN or other genes associated with the 
FTEN/1GF pathway. As used herein, mis-expression encompasses ectopic 
expression, over-expression, under-cxpression, and non-expression (e.g. by gene 
knock-out or blocking expression that would otherwise normally occur). 

Genetically modified animals 

Animal models that have been genetically modified to alter MPTEN expression 
may be used in in vivo assays to test for activity of a candidate PTEN/IGF modulating 
agent, or to further assess the role of MPTEN in a PTEN/IGF pathway process such 
as apoptosis or cell proliferation. Preferably, the altered MPTEN expression results 
in a detectable phenotype, such as decreased or increased levels of cell proliferation, 
angiogenesis, or apoptosis compared to control animals having normal MPTEN 
expression. The genetically modified animal may additionally have altered 
PTEN/IGF expression (e.g. PTEN/IGF knockout). Preferred genetically modified 
animals are mammals such as primates, rodents (preferably mice or rats), among 
others. Preferred non-mammalian species include zebrafish, C. elegans, and 
Drosophila. Preferred genetically modified animals are transgenic animals having a 
heterologous nucleic acid sequence present as an extrachromosomal element in a 
portion of its cells, i.e. mosaic animals (see, for example, techniques described by 
Jakobovits, 1994, Curr. Biol. 4:761-763.) or stably integrated into its germ line DNA 
(i.e., in the genomic sequence of most or all of its cells). Heterologous nucleic acid is 
introduced into the germ line of such transgenic animals by genetic manipulation of, 
for example, embryos or embryonic stem cells of the host animal. 

Methods of making transgenic animals are well-known in the art (for transgenic 
mice see Brinsteret al., Proc. Nat. Acad. Sci. USA 82: 4438-4442 (1985), U.S. Pat. 
Nos. 4,736,866 and 4,870,009, both by Leder et al., U.S. Pat. No. 4,873,191 by 
Wagner et al., and Hogan, B., Manipulating the Mouse Embryo, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., (1986); for particle bombardment see 



It 



EX03-029P 

U.S. Pat. No., 4,945,050, by Sandford et al.\ for transgenic Drosophila see Rubin and 
Spradling, Science (1982) 218:348-53 and U.S. Pat. No. 4,670,388; for transgenic 
insects see Berghammer AJ. et al, A Universal Marker for Transgenic Insects (1999) 
Nature 402:370-371; for transgenic Zebrafish see Lin S., Transgenic Zebrafish, 
Methods Mol Biol. (2000);136:375-3830); for microinjection procedures for fish, 
amphibian eggs and birds see Houdebine and Chourrout, Expericntia (1991) 47:897- 
905; for transgenic rats see Hammer et al, Cell (1990) 63: 1099-1 1 12; and for 
culturing of embryonic stem (ES) cells and the subsequent production of transgenic 
animals by the introduction of DNA into ES cells using methods such as 
clectroporation, calcium phosphate/DNA precipitation and direct injection see, e.g., 
Teratocarcinomas and Embryonic Stem Cells, A Practical Approach, E. J. Robertson, 
ed., IRL Press (1987)). Clones of the nonhuman transgenic animals can be produced 
according to available methods (see Wilmut, I. et al (1997) Nature 385:810-813; and 
PCT International Publication Nos. WO 97/07668 and WO 97/07669). 

In one embodiment, the transgenic animal is a "knock-out" animal having a 
heterozygous or homozygous alteration in the sequence of an endogenous MPTEN 
gene that results in a decrease of MPTEN function, preferably such that MPTEN 
expression is undetectable or insignificant. Knock-out animals are typically 
generated by homologous recombination with a vector comprising a transgene having 
at least a portion of the gene to be knocked out. Typically a deletion, addition or 
substitution has been introduced into the transgene to functionally disrupt it. The 
transgene can be a human gene (e.g., from a human genomic clone) but more 
preferably is an ortholog of the human gene derived from the transgenic host species. 
For example, a mouse MPTEN gene is used to construct a homologous recombination 
vector suitable for altering an endogenous MPTEN gene in the mouse genome. 
Detailed methodologies for homologous recombination in mice are available (see 
Capecchi, Science (1989) 244:1288-1292; Joyner et al, Nature (1989) 338: 153-156). 
Procedures for the production of non-rodent transgenic mammals and other animals 
are also available (Houdebine and Chourrout, supra; Pursel et al, Science (1989) 
244: 1281-1288; Simms et al, Bio/Technology (1988) 6:179-183). In a preferred 
embodiment, knock-out animals, such as mice harboring a knockout of a specific 
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gene, may be used to produce antibodies against the human counterpart of the gene 
that has been knocked out (Claesson MH et al., (1994) Scan J Immunol 40:257-264; 
Declerck PJ et al., (1995) J Biol Chem. 270:8397-400). 

In another embodiment, the transgenic animal is a "knock-in" animal having an 
alteration in its genome that results in altered expression (e.g., increased (including 
ectopic) or decreased expression) of the MPTEN gene, e.g., by introduction of additional 
copies of MPTEN, or by operatively inserting a regulatory sequence that provides for 
altered expression of an endogenous copy of the MPTEN gene. Such regulatory 
sequences include inducible, tissue-specific, and constitutive promoters and enhancer 
elements. The knock-in can be homozygous or heterozygous. 

Transgenic nonhuman animals can also be produced that contain selected systems 
allowing for regulated expression of the transgene. One example of such a system that 
may be produced is the cre/loxP recombinase system of bacteriophage PI (Lakso et a/., 
PNAS (1992) 89:6232-6236; U.S. Pat. No. 4,959,317). If a cre/loxP recombinase system 
is used to regulate expression of the transgene, animals containing transgenes encoding 
both the Cre recombinase and a selected protein are required. Such animals can be 
provided through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the other 
containing a transgene encoding a recombinase. Another example of a recombinase 
system is the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. 
(1991) Science 251:1351-1355; U.S. Pat. No. 5,654,182). In a preferred embodiment, 
both Cre-LoxP and Flp-Frt are used in the same system to regulate expression of the 
transgene, and for sequential deletion of vector sequences in the same cell (Sun X et al 
(2000) Nat Genet 25:83-6). 

The genetically modified animals can be used in genetic studies to further 
elucidate the PTEN/IGF pathway, as animal models of disease and disorders implicating 
defective PTEN/IGF function, and for in vivo testing of candidate therapeutic agents, 
such as those identified in screens described below. The candidate therapeutic agents are 
administered to a genetically modified animal having altered MPTEN function and 
phenotypic changes are compared with appropriate control animals such as genetically 
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modified animals that receive placebo treatment, and/or animals with unaltered MPTEN 
expression that receive candidate therapeutic agent. 

In addition to the above-described genetically modified animals having altered 
MPTEN function, animal models having defective PTEN/IGF function (and otherwise 
normal MPTEN function), can be used in the methods of the present invention. For 
example, a mouse with defective PTEN function can be used to assess, in vivo, the 
activity of a candidate PTEN modulating agent identified in one of the in vitro assays 
described below. Transgenic mice with defective PTEN function have been described in 
literature (Di Cristofano et al, supra). Preferably, the candidate PTEN/IGF modulating 
agent when administered to a model system with cells defective in PTEN/IGF function, 
produces a detectable phenotypic change in the model system indicating that the 
PTEN/IGF function is restored, i.e., the cells exhibit normal cell cycle progression. 

Modulating Agents 

The invention provides methods to identify agents that interact with and/or modulate 
the function of MPTEN and/or the PTEN/IGF pathway. Modulating agents identified by 
the methods are also part of the invention. Such agents are useful in a variety of 
diagnostic and therapeutic applications associated with the PTEN/IGF pathway, as well 
as in further analysis of the MPTEN protein and its contribution to the PTEN/IGF 
pathway. Accordingly, the invention also provides methods for modulating the 
PTEN/IGF pathway comprising the step of specifically modulating MPTEN activity by 
administering a MPTEN-interacting or -modulating agent. 

As used herein, an "MPTEN-modulating agent" is any agent that modulates MPTEN 
function, for example, an agent that interacts with MPTEN to inhibit or enhance MPTEN 
activity or otherwise affect normal MPTEN function. MPTEN function can be affected 
at any level, including transcription, protein expression, protein localization, and cellular 
or extra-cellular activity. In a preferred embodiment, the MPTEN - modulating agent 
specifically modulates the function of the MPTEN. The phrases "specific modulating 
agent", "specifically modulates", etc., are used herein to refer to modulating agents that 
directly bind to the MPTEN polypeptide or nucleic acid, and preferably inhibit, enhance, 
or otherwise alter, the function of the MPTEN. These phrases also encompass 
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modulating agents that alter the interaction of the MPTEN with a binding partner, 
substrate, or cofactor (e.g. by binding to a binding partner of an MPTEN, or to a 
protein/binding partner complex, and altering MPTEN function). In a further preferred 
embodiment, the MPTEN- modulating agent is a modulator of the PTEN/1GF pathway 
(e.g. it restores and/or upregulates PTEN/IGF function) and thus is also a PTEN/IGF- 
modulating agent. 

Preferred MPTEN-modulating agents include small molecule compounds; MPTEN- 
interacting proteins, including antibodies and other biotherapeutics; and nucleic acid 
modulators such as antisense and RNA inhibitors. The modulating agents may be 
formulated in pharmaceutical compositions, for example, as compositions that may 
comprise other active ingredients, as in combination therapy, and/or suitable carriers or 
excipients. Techniques for formulation and administration of the compounds may be 
found in "Remington's Pharmaceutical Sciences" Mack Publishing Co., Easton, PA, 19 th 
edition. 

Small molecule modulators 

Small molecules are often preferred to modulate function of proteins with enzymatic 
function, and/or containing protein interaction domains. Chemical agents, referred to in 
the art as "small molecule" compounds are typically organic, non-peptide molecules, 
having a molecular weight less than 10,000, preferably less than 5,000, more preferably 
less than 1,000, and most preferably less than 500 daltons. This class of modulators 
includes chemically synthesized molecules, for instance, compounds from combinatorial 
chemical libraries. Synthetic compounds may be rationally designed or identified based 
on known or inferred properties of the MPTEN protein or may be identified by screening 
compound libraries. Alternative appropriate modulators of this class are natural products, 
particularly secondary metabolites from organisms such as plants or fungi, which can 
also be identified by screening compound libraries for MPTEN-modulating activity. 
Methods for generating and obtaining compounds are well known in the art (Schreiber 
SL, Science (2000) 151: 1964-1969; Radmann J and Gunther J, Science (2000) 
151:1947-1948). 
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Small molecule modulators identified from screening assays, as described below, can 
be used as lead compounds from which candidate clinical compounds may be designed, 
optimized, and synthesized. Such clinical compounds may have utility in treating 
pathologies associated with the PTEN/IGF pathway. The activity of candidate small 
molecule modulating agents may be improved several-fold through iterative secondary 
functional validation, as further described below, structure determination, and candidate 
modulator modification and testing. Additionally, candidate clinical compounds are 
generated with specific regard to clinical and pharmacological properties. For example, 
the reagents may be derivatized and re-screened using in vitro and in vivo assays to 
optimize activity and minimize toxicity for pharmaceutical development. 

Protein Modulators 

Specific MPTEN-interacting proteins are useful in a variety of diagnostic and 
therapeutic applications related to the PTEN/IGF pathway and related disorders, as well 
as in validation assays for other MPTEN-modulating agents. In a preferred embodiment, 
MPTEN-interacting proteins affect normal MPTEN function, including transcription, 
protein expression, protein localization, and cellular or extra-cellular activity. In another 
embodiment, MPTEN-interacting proteins are useful in detecting and providing 
information about the function of MPTEN proteins, as is relevant to PTEN/IGF related 
disorders, such as cancer (e.g., for diagnostic means). 

An MPTEN-interacting protein may be endogenous, i.e. one that naturally interacts 
genetically or biochemically with an MPTEN, such as a member of the MPTEN pathway 
that modulates MPTEN expression, localization, and/or activity. MPTEN-modulators 
include dominant negative forms of MPTEN-interacting proteins and of MPTEN proteins 
themselves. Yeast two-hybrid and variant screens offer preferred methods for identifying 
endogenous MPTEN-interacting proteins (Finley, R. L. et al. (1996) in DNA Cloning- 
Expression Systems: A Practical Approach, eds. Glover D. & Hames B. D (Oxford 
University Press, Oxford, England), pp. 169-203; Fashema SF et al., Gene (2000) 250: 1- 
14; Drees BL Curr Opin Chem Biol (1999) 3:64-70; Vidal M and Legrain P Nucleic 
Acids Res (1999) 27:919-29; and U.S. Pat. No. 5,928,868). Mass spectrometry is an 
alternative preferred method for the elucidation of protein complexes (reviewed in, e.g., 



16 



EX03-029P 

Pandley A and Mann M, Nature (2000) 405:837-846; Yates JR 3 rd , Trends Genet (2000) 
16:5-8). 

An MPTEN-interacting protein may be an exogenous protein, such as an MPTEN- 
specific antibody or a T-cell antigen receptor (see, e.g., Harlow and Lane (1988) 
Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory; Harlow and Lane 
(1999) Using antibodies: a laboratory manual. Cold Spring Harbor, NY: Cold Spring 
Harbor Laboratory Press). MPTEN antibodies are further discussed below. 

In preferred embodiments, an MPTEN-interacting protein specifically binds an 
MPTEN protein. In alternative preferred embodiments, an MPTEN-modulating agent 
binds an MPTEN substrate, binding partner, or cofactor. 

Antibodies 

In another embodiment, the protein modulator is an MPTEN specific antibody agonist 
or antagonist. The antibodies have therapeutic and diagnostic utilities, and can be used in 
screening assays to identify MPTEN modulators. The antibodies can also be used in 
dissecting the portions of the MPTEN pathway responsible for various cellular responses 
and in the general processing and maturation of the MPTEN. 

Antibodies that specifically bind MPTEN polypeptides can be generated using known 
methods. Preferably the antibody is specific to a mammalian ortholog of MPTEN 
polypeptide, and more preferably, to human MPTEN. Antibodies may be polyclonal, 
monoclonal (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab 
fragments, F(ab , ).sub.2 fragments, fragments produced by a FAb expression library, anti- 
idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. 
Epitopes of MPTEN which are particularly antigenic can be selected, for example, by 
routine screening of MPTEN polypeptides for antigenicity or by applying a theoretical 
method for selecting antigenic regions of a protein (Hopp and Wood (1981), Proc. Nati. 
Acad. Sci. US.A. 78:3824-28; Hopp and Wood, (1983) Mol. Immunol. 20:483-89; 
Sutcliffe et al., (1983) Science 219:660-66) to the amino acid sequence of an MPTEN. 
Monoclonal antibodies with affinities of 10 8 M" 1 preferably 10 9 M~' to 10 10 M"\ or 
stronger can be made by standard procedures as described (Harlow and Lane, supra\ 
Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed) Academic Press, 
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New York; and U.S. Pat. Nos. 4,381,292; 4,451,570; and 4,618,577). Antibodies may be 
generated against crude cell extracts of MPTEN or substantially purified fragments 
thereof. If MPTEN fragments are used, they preferably comprise at least 10, and more 
preferably, at least 20 contiguous amino acids of an MPTEN protein. In a particular 
embodiment, MPTEN-specific antigens and/or immunogens are coupled to carrier 
proteins that stimulate the immune response. For example, the subject polypeptides are 
covalently coupled to the keyhole limpet hemocyanin (KLH) carrier, and the conjugate is 
emulsified in Freund's complete adjuvant, which enhances the immune response. An 
appropriate immune system such as a laboratory rabbit or mouse is immunized according 
to conventional protocols. 

The presence of MPTEN-specific antibodies is assayed by an appropriate assay such 
as a solid phase enzyme-linked immunosorbant assay (ELISA) using immobilized 
corresponding MPTEN polypeptides. Other assays, such as radioimmunoassays or 
fluorescent assays might also be used. 

Chimeric antibodies specific to MPTEN polypeptides can be made that contain 
different portions from different animal species. For instance, a human immunoglobulin 
constant region may be linked to a variable region of a murine mAb, such that the 
antibody derives its biological activity from the human antibody, and its binding 
specificity from the murine fragment. Chimeric antibodies are produced by splicing 
together genes that encode the appropriate regions from each species (Morrison et al., 
Proc. Natl. Acad. Sci. (1984) 81:6851-6855; Neuberger et ah, Nature (1984) 312:604- 
608; Takeda et al., Nature (1985) 31:452-454). Humanized antibodies, which are a form 
of chimeric antibodies, can be generated by grafting complementary-determining regions 
(CDRs) (Carlos, T. M., J. M. Harlan. 1994. Blood 84:2068-2101) of mouse antibodies 
into a background of human framework regions and constant regions by recombinant 
DNA technology (Riechmann LM, et al., 1988 Nature 323: 323-327). Humanized 
antibodies contain -10% murine sequences and -90% human sequences, and thus further 
reduce or eliminate immunogenicity, while retaining the antibody specificities (Co MS, 
and Queen C. 1991 Nature 351: 501-501; Morrison SL. 1992 Ann. Rev. Immun. 
10:239-265). Humanized antibodies and methods of their production are well-known in 
the art (U.S. Pat. Nos. 5,530,101, 5,585,089, 5,693,762, and 6,180,370). 
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MPTEN-specific single chain antibodies which are recombinant, single chain 
polypeptides formed by linking the heavy and light chain fragments of the Fv regions via 
an amino acid bridge, can be produced by methods known in the art (U.S. Pat. No. 
4,946,778; Bird, Science (1988) 242:423-426; Huston et al., Proc. Natl. Acad. Sci. USA 
(1988) 85:5879-5883; and Ward et al., Nature (1989) 334:544-546). 

Other suitable techniques for antibody production involve in vitro exposure of 
lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of 
antibodies in phage or similar vectors (Huse et al., Science (1989) 246:1275-1281). As 
used herein, T-cell antigen receptors are included within the scope of antibody 
modulators (Harlow and Lane, 1988, supra). 

The polypeptides and antibodies of the present invention may be used with or without 
modification. Frequently, antibodies will be labeled by joining, either covalently or non- 
covalently, a substance that provides for a detectable signal, or that is toxic to cells that 
express the targeted protein (Menard S, et al., Int J. Biol Markers (1989) 4: 131-134). A 
wide variety of labels and conjugation techniques are known and are reported extensively 
in both the scientific and patent literature. Suitable labels include radionuclides, 
enzymes, substrates, cofactors, inhibitors, fluorescent moieties, fluorescent emitting 
lanthanide metals, chemiluminescent moieties, bioluminescent moieties, magnetic 
particles, and the like (U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 
4,277,437; 4,275,149; and 4,366,241). Also, recombinant immunoglobulins may be 
produced (U.S. Pat. No. 4,816,567). Antibodies to cytoplasmic polypeptides may be 
delivered and reach their targets by conjugation with membrane-penetrating toxin 
proteins (U.S. Pat. No. 6,086,900). 

When used therapeutically in a patient, the antibodies of the subject invention are 
typically administered parenterally, when possible at the target site, or intravenously. The 
therapeutically effective dose and dosage regimen is determined by clinical studies. 
Typically, the amount of antibody administered is in the range of about 0.1 mg/kg -to 
about 10 mg/kg of patient weight. For parenteral administration, the antibodies are 
formulated in a unit dosage injectable form (e.g., solution, suspension, emulsion) in 
association with a pharmaceutically acceptable vehicle. Such vehicles are inherently 
nontoxic and non-therapeutic. Examples are water, saline, Ringer's solution, dextrose 
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solution, and 5% human serum albumin. Nonaqueous vehicles such as fixed oils, ethyl 
oleate, or liposome carriers may also be used. The vehicle may contain minor amounts of 
additives, such as buffers and preservatives, which enhance isotonicity and chemical 
stability or otherwise enhance therapeutic potential. The antibodies' concentrations in 
such vehicles are typically in the range of about 1 mg/ml to aboutlO mg/ml. 
Immunotherapeutic methods are further described in the literature (US Pat. No. 
5,859,206; WO0073469). 

Specific biotherapeutics 

Tn a preferred embodiment, an MPTEN-interacting protein may have biotherapeutic 
applications. Biotherapeutic agents formulated in pharmaceutically acceptable carriers 
and dosages may be used to activate or inhibit signal transduction pathways. This 
modulation may be accomplished by binding a ligand, thus inhibiting the activity of the 
pathway; or by binding a receptor, either to inhibit activation of, or to activate, the 
receptor. Alternatively, the biotherapeutic may itself be a ligand capable of activating or 
inhibiting a receptor. Biotherapeutic agents and methods of producing them are 
described in detail in U.S. Pat. No. 6,146,628. 

When the MPTEN is a ligand, it may be used as a biotherapeutic agent to activate or 
inhibit its natural receptor. Alternatively, antibodies against MPTEN, as described in the 
previous section, may be used as biotherapeutic agents. 

When the MPTEN is a receptor, its ligand(s), antibodies to the ligand(s) or the 
MPTEN itself may be used as biotherapeutics to modulate the activity of MPTEN in the 
PTEN/IGF pathway. 

Nucleic Acid Modulators 

Other preferred MPTEN-modulating agents comprise nucleic acid molecules, such as 
antisense oligomers or double stranded RNA (dsRNA), which generally inhibit MPTEN 
activity. Preferred nucleic acid modulators interfere with the function of the MPTEN 
nucleic acid such as DNA replication, transcription, translocation of the MPTEN RNA to 
the site of protein translation, translation of protein from the MPTEN RNA, splicing of 
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the MPTEN RNA to yield one or more mRNA species, or catalytic activity which may be 
engaged in or facilitated by the MPTEN RNA. 

In one embodiment, the antisense oligomer is an oligonucleotide that is sufficiently 
complementary to an MPTEN mRNA to bind to and prevent translation, preferably by 
binding to the 5' untranslated region. MPTEN-specific antisense oligonucleotides, 
preferably range from at least 6 to about 200 nucleotides. In some embodiments the 
oligonucleotide is preferably at least 10, 15, or 20 nucleotides in length. In other 
embodiments, the oligonucleotide is preferably less than 50, 40, or 30 nucleotides in 
length. The oligonucleotide can be DNA or RNA or a chimeric mixture or derivatives or 
modified versions thereof, single-stranded or double-stranded. The oligonucleotide can 
be modified at the base moiety, sugar moiety, or phosphate backbone. The 
oligonucleotide may include other appending groups such as peptides, agents that 
facilitate transport across the cell membrane, hybridization- triggered cleavage agents, and 
intercalating agents. 

In another embodiment, the antisense oligomer is a phosphothioate morpholino 
oligomer (PMO). PMOs are assembled from four different morpholino subuntts, each of 
which contain one of four genetic bases (A, C, G, or T) linked to a six-membered 
morpholine ring. Polymers of these subunits are joined by non-ionic phosphodiamidate 
intersubunit linkages. Details of how to make and use PMOs and other antisense 
oligomers are well known in the art (e.g. see W099/18193; Probst JC, Antisense 
Oligodeoxynucleotide and Ribozyme Design, Methods. (2000) 22(3):271-281; 
Summerton J, and Weller D. 1997 Antisense Nucleic Acid Drug Dev. :7: 187-95; US Pat, 
No. 5,235,033; and US Pat No. 5,378,841). 

Alternative preferred MPTEN nucleic acid modulators are double-stranded RNA 
species mediating RNA interference (RNAi). RNAi is the process of sequence-specific, 
post-transcriptional gene silencing in animals and plants, initiated by double-stranded 
RNA (dsRNA) that is homologous in sequence to the silenced gene. Methods relating to 
the use of RNAi to silence genes in C elegans, Drosophila, plants, and humans are 
known in the art (Fire A, et al. v 1998 Nature 391:806-811; Fire, A. Trends Genet. 15, 
358-363 (1999); Sharp, P. A. RNA interference 2001. Genes Dev. 15, 485-490 (2001); 
Hammond, S. M„ et al., Nature Rev. Genet. 2, 110-11 19 (2001); Tuschl, T. Chem. 
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Biochem. 2, 239-245 (2001); Hamilton, A. et al., Science 286, 950-952 (1999); 
Hammond, S. M., et al., Nature 404, 293-296 (2000); Zamorc, P. D., et al., Cell 101, 25- 
33 (2000); Bernstein, E., et al., Nature 409, 363-366 (2001); Elbashir, S. M., et al., 
Genes Dev. 15, 188-200 (2001); WO0129058; W09932619; Elbashir SM, et al., 2001 
Nature 411:494-498). 

Nucleic acid modulators are commonly used as research reagents, diagnostics, and 
therapeutics. For example, antisense oligonucleotides, which are able to inhibit gene 
expression with exquisite specificity, are often used to elucidate the function of particular 
genes (see, for example, U.S. Pat. No. 6,165,790). Nucleic acid modulators are also used, 
for example, to distinguish between functions of various members of a biological 
pathway. For example, antisense oligomers have been employed as therapeutic moieties 
in the treatment of disease states in animals and man and have been demonstrated in 
numerous clinical trials to be safe and effective (Milligan JF, et al, Current Concepts in 
Antisense Drug Design, J Med Chem. (1993) 36: 1923-1937; Tonkinson JL et al y 
Antisense Oligodeoxynucleotides as Clinical Therapeutic Agents, Cancer Invest. (1996) 
14:54-65). Accordingly, in one aspect of the invention, an MPTEN-specific nucleic acid 
modulator is used in an assay to further elucidate the role of the MPTEN in the 
PTEN/IGF pathway, and/or its relationship to other members of the pathway. In another 
aspect of the invention, an MPTEN-specific antisense oligomer is used as a therapeutic 
agent for treatment of PTEN/IGF-related disease states. 

Assay Systems 

The invention provides assay systems and screening methods for identifying specific 
modulators of MPTEN activity. As used herein, an "assay system" encompasses all the 
components required for performing and analyzing results of an assay that delects and/or 
measures a particular event. In general, primary assays are used to identify or confirm a 
modulator's specific biochemical or molecular effect with respect to the MPTEN nucleic 
acid or protein. In general, secondary assays further assess the activity of a MPTEN 
modulating agent identified by a primary assay and may confirm that the modulating 
agent affects MPTEN in a manner relevant to the PTEN/IGF pathway. In some cases, 
MPTEN modulators will be directly tested in a secondary assay. 
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In a preferred embodiment, the screening method comprises contacting a suitable 
assay system comprising an MPTEN polypeptide or nucleic acid with a candidate agent 
under conditions whereby, but for the presence of the agent, the system provides a 
reference activity (e.g. binding activity), which is based on the particular molecular event 
the screening method detects. A statistically significant difference between the agent- 
biased activity and the reference activity indicates that the candidate agent modulates 
MPTEN activity, and hence the PTEN/IGF pathway. The MPTEN polypeptide or 
nucleic acid used in the assay may comprise any of the nucleic acids or polypeptides 
described above. 

Primary Assays 

The type of modulator tested generally determines the type of primary assay. 

Primary assays for small molecule modulators 
For small molecule modulators, screening assays are used to identify candidate 
modulators. Screening assays may be cell-based or may use a cell-free system that 
recreates or retains the relevant biochemical reaction of the target protein (reviewed in 
Sittampalam GS ex al., Curr Opin Chem Biol (1997) 1:384-91 and accompanying 
references). As used herein the term "cell-based" refers to assays using live cells, dead 
cells, or a particular cellular fraction, such as a membrane, endoplasmic reticulum, or 
mitochondrial fraction. The term "cell free" encompasses assays using substantially 
purified protein (either endogenous or recombinantly produced), partially purified or 
crude cellular extracts. Screening assays may detect a variety of molecular events, 
including protein-DNA interactions, protein-protein interactions (e.g. , receptor-ligand 
binding), transcriptional activity (e.g., using a reporter gene), enzymatic activity (e.g., via 
a property of the substrate), activity of second messengers, immunogenicty and changes 
in cellular morphology or other cellular characteristics. Appropriate screening assays 
may use a wide range of detection methods including fluorescent, radioactive, 
colorimetric, spectrophotometric, and amperometric methods, to provide a read-out for 
the particular molecular event detected. 
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Cell-based screening assays usually require systems for recombinant expression of 
MPTEN and any auxiliary proteins demanded by the particular assay. Appropriate 
methods for generating recombinant proteins produce sufficient quantities of proteins that 
retain their relevant biological activities and are of sufficient purity to optimize activity 
and assure assay reproducibility. Yeast two-hybrid and variant screens, and mass 
spectrometry provide preferred methods for determining protein-protein interactions and 
elucidation of protein complexes. In certain applications, when MPTEN-interacting 
proteins are used in screens to identify small molecule modulators, the binding specificity 
of the interacting protein to the MPTEN protein may be assayed by various known 
methods such as substrate processing (e.g. ability of the candidate MFTEN-specific 
binding agents to function as negative effectors in MPTEN-expressing cells), binding 
equilibrium constants (usually at least about 10 7 M"\ preferably at least about 10 8 M"\ 
more preferably at least about 10 9 M" 1 ), and immunogenicity (e.g. ability to elicit 
MPTEN specific antibody in a heterologous host such as a mouse, rat, goat or rabbit). 
For enzymes and receptors, binding may be assayed by, respectively, substrate and ligand 
processing. 

The screening assay may measure a candidate agent's ability to specifically bind to or 
modulate activity of a MPTEN polypeptide, a fusion protein thereof, or to cells or 
membranes bearing the polypeptide or fusion protein. The MPTEN polypeptide can be 
full length or a fragment thereof that retains functional MPTEN activity. The MPTEN 
polypeptide may be fused to another polypeptide, such as a peptide tag for detection or 
anchoring, or to another tag. The MPTEN polypeptide is preferably human MPTEN, or 
is an ortholog or derivative thereof as described above. In a preferred embodiment, the 
screening assay detects candidate agent-based modulation of MPTEN interaction with a 
binding target, such as an endogenous or exogenous protein or other substrate that has 
MPTEN -specific binding activity, and can be used to assess normal MPTEN gene 
function. 

Suitable assay formats that may be adapted to screen for MPTEN modulators are 
known in the art. Preferred screening assays are high throughput or ultra high throughput 
and thus provide automated, cost-effective means of screening compound libraries for 
lead compounds (Femandes PB, Curr Opin Chem Biol (1998) 2:597-603; Sundberg SA, 
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Curr Opin Biotechnol 2000, 1 1:47-53). In one preferred embodiment, screening assays 
uses fluorescence technologies, including fluorescence polarization, time-resolved 
fluorescence, and fluorescence resonance energy transfer. These systems offer means to 
monitor protein-protein or DNA-protein interactions in which the intensity of the signal 
emitted from dye-labeled molecules depends upon their interactions with partner 
molecules (e.g., Selvin PR, Nat Struct Biol (2000) 7:730-4; Fernandes PB, supra; 
Hertzberg RP and Pope AJ, Curr Opin Chem Biol (2000) 4:445-451). 

A variety of suitable assay systems may be used to identify candidate MPTEN and 
PTEN/IGF pathway modulators (U.S. Pat. Nos. 5,550,019 and 6,133,437 (apoptosis 
assays); U.S. Pat. No. 6,114,132 (phosphatase and protease assays), U.S. Pat. Nos. 
5,976,782, 6,225,118 and 6,444,434 (angiogenesis assays), among others). Specific 
preferred assays are described in more detail below. 

Proteases are enzymes that cleave protein substrates at specific sites. Exemplary 
assays detect the alterations in the spectral properties of an artificial substrate that occur 
upon protease-mediated cleavage. In one example, synthetic caspase substrates 
containing four amino acid proteolysis recognition sequences, separating two different 
fluorescent tags are employed; fluorescence resonance energy transfer detects the 
proximity of these fluorophores, which indicates whether the substrate is cleaved 
(Mahajan NP et al., Chem Biol (1999) 6:401-409). 

RNA folds into a myriad of tertiary structures that are responsible for its diverse 
functions in cells. In most instances, RNA is associated with RNA-binding proteins 
(RBPs) that protect, stabilize, package or transport RNA, mediate RNA interactions with 
other biomolecules or act catalytically on RNA. The structural information obtained for 
RNA alone and RNA-protein complexes has elucidated a variety of RNA tertiary 
structures and diverse modes for RNA-protein interaction. The specific interaction of 
proteins with highly structured RNAs makes it possible to target unique RNA motifs with 
small molecules, thus making RNA an interesting target for therapeutic intervention. 
Assays for RNA binding or processing may be based on homogeneous scintillation 
proximity (Liu J, et al., Anal Biochem 2001 289:239-245), chemiluminescense 
(Mazumder A, Nucleic Acids Res 1998 26:1996-2000), gel shift (Stull RA, et al., 
Antisense Nucleic Acid Drug Dev 1996 6:221-228; U.S. Pat. No: 6004749). 
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Transcription factors control gene transcription. Electrophoretic mobility shift 
assay (EMS A) or gel shift assay is one of the most powerful methods for studying 
protein-DNA interactions. High throughput gel shift assays for transcription factors may 
involve fluorescence (Cyano dye Cy5) labeled oligodeoxynucleotide duplexes as specific 
probes and an automatic DNA sequencer for analysis (Ruscher K, et al., (2000) J 
Biotechnol 78:163-70). Alternatively high throughput methods involve colorimetric 
assays (Renard P, et al. (2001) Nucleic Acids Res 29(4):E21), or homogeneous 
fluorescence assays for the detection and quantification of sequence-specific DNA- 
binding proteins (Heyduk T, and Heyduk E (2001) Nat Biotechnol 20:171-6.) 

Reductases are enzymes of oxidoreductase class that catalyze reactions in which 
metabolites are reduced. High throughput screening assays for reductases may involve 
scintillation (Femandes PB. (1998) Curr Opin Chem Biol 2:597-603; Delaporte E et al. 
(2001) J Biomol Screen 6:225-231). 

Assays for ATPase activity are known in the art, such as described in Blackburn 
et al (Blackburn CL, et al„ (1999) J Org Chem 64:5565-5570). The ATPase assay is 
performed using the EnzCheck ATPase kit (Molecular Probes). The assays are performed 
using an Ultraspec spectrophotometer (Pharmacia), and the progress of the reaction are 
monitored by absorbance increase at 360 nm. Microtubules (1.7 mM final), kinesin ( 
0.1 1 mM final), inhibitor (or DMSO blank at 5% final), and the EnzCheck components 
(purine nucleotide phosphorylase and MESG substrate) are premixed in the cuvette in a 
reaction buffer (40 mM PIPES pH 6.8, 5 mM paclitaxel, 1 mM EGTA, 5 mM MgC12). 
The reaction is initiated by addition of MgATP (1 mM final). 

High- throughput assays, such as scintillation proximity assays, for synthase 
enzymes involved in fatty acid synthesis are known in the art (He X et al (2000) Anal 
Biochem 2000 Jun 15;282(1): 107-14). 

Apoptosis assays. Assays for apoptosis may be performed by terminal 
deoxynucleotidyl transferase-mediated digoxigenin-1 1-dUTP nick end labeling (TUNEL) 
assay. The TUNEL assay is used to measure nuclear DNA fragmentation characteristic 
of apoptosis ( Lazebnik et ai, 1994, Nature 371, 346), by following the incorporation of 
fluorescein-dUTP (Yonehara et al., 1989, J. Exp. Med. 169, 1747). Apoptosis may 
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further be assayed by acridine orange staining of tissue culture cells (Lucas, R., et al., 
1998, Blood 15:4730-41). An apoptosis assay system may comprise a cell that expresses 
an MPTEN, and that optionally has defective PTEN/1GF function (e.g. PTEN/IGF is 
over-expressed or under-expressed relative to wild-type cells). A test agent can be added 
to the apoptosis assay system and changes in induction of apoptosis relative to controls 
where no test agent is added, identify candidate PTEN/IGF modulating agents. In some 
embodiments of the invention, an apoptosis assay may be used as a secondary assay to 
test a candidate PTEN/IGF modulating agents that is initially identified using a cell-free 
assay system. An apoptosis assay may also be used to test whether MPTEN function 
plays a direct role in apoptosis. For example, an apoptosis assay may be performed on 
cells that over- or under-express MPTEN relative to wild type cells. Differences in 
apoptotic response compared to wild type cells suggests that the MPTEN plays a direct 
role in the apoptotic response. Apoptosis assays are described further in US Pat. No. 
6,133,437. 

Cell proliferation and cell cycle assays. Cell proliferation may be assayed via 
bromodeoxyuridine (BRDU) incorporation. This assay identifies a cell population 
undergoing DNA synthesis by incorporation of BRDU into newly-synthesized DNA. 
Newly-synthesized DNA may then be detected using an anti-BRDU antibody (Hoshino et 
a/., 1986, Int. J. Cancer 38, 369; Campana et al„ 1988, J. Immunol. Meth. 107, 79), or by 
other means. 

Cell proliferation is also assayed via phospho-histone H3 staining, which identifies a 
cell population undergoing mitosis by phosphorylation of histone H3. Phosphorylation of 
histone H3 at serine 10 is detected using an antibody specfic to the phosphorylated form 
of the serine 10 residue of histone H3. (Chadlee,D.N. 1995, J. Biol. Chem 270:20098- 
105). Cell Proliferation may also be examined using [ 3 H]-thymidine incorporation 
(Chen, J., 1996, Oncogene 13:1395-403; Jeoung, J., 1995, J. Biol. Chem. 270:18367-73). 
This assay allows for quantitative characterization of S-phase DNA syntheses. In this 
assay, cells synthesizing DNA will incorporate [ 3 H]-thymidine into newly synthesized 
DNA. Incorporation can then be measured by standard techniques such as by counting of 
radioisotope in a scintillation counter (e.g., Beckman LS 3800 Liquid Scintillation 
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Counter). Another proliferation assay uses the dye Alamar Blue (available from 
Biosource International), which fluoresces when reduced in living cells and provides an 
indirect measurement of cell number (Voytik-Harbin SL et al., 1998, In Vitro Cell Dev 
Biol Anim 34:239-46). 

Cell proliferation may also be assayed by colony formation in soft agar (Sambrook ct 
al., Molecular Cloning, Cold Spring Harbor (1989)). For example, cells transformed with 
MPTEN are seeded in soft agar plates, and colonies are measured and counted after two 
weeks incubation. 

Involvement of a gene in the cell cycle may be assayed by flow cytometry (Gray JW 
et al. (1986) Int J Radiat Biol Relat Stud Phys Chem Med 49:237-55). Cells transfected 
with an MPTEN may be stained with propidium iodide and evaluated in a flow cytometer 
(available from Becton Dickinson), which indicates accumulation of cells in different 
stages of the cell cycle. 

Accordingly, a cell proliferation or cell cycle assay system may comprise a cell that 
expresses an MPTEN, and that optionally has defective PTEN/IGF function (e.g. 
PTEN/IGF is over-expressed or under-expressed relative to wild-type cells). A test agent 
can be added to the assay system and changes in cell proliferation or cell cycle relative to 
controls where no test agent is added, identify candidate PTEN/IGF modulating agents. 
In some embodiments of the invention, the cell proliferation or cell cycle assay may be 
used as a secondary assay to test a candidate PTEN/IGF modulating agents that is 
initially identified using another assay system such as a cell-free assay system. A cell 
proliferation assay may also be used to test whether MPTEN function plays a direct role 
in cell proliferation or cell cycle. For example, a cell proliferation or cell cycle assay 
may be performed on cells that over- or under-express MPTEN relative to wild type cells. 
Differences in proliferation or cell cycle compared to wild type cells suggests that the 
MPTEN plays a direct role in cell proliferation or cell cycle. 

Angiogenesis. Angiogenesis may be assayed using various human endothelial cell 
systems, such as umbilical vein, coronary artery, or dermal cells. Suitable assays include 
Alamar Blue based assays (available from Biosource International) to measure 
proliferation; migration assays using fluorescent molecules, such as the use of Becton 
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Dickinson Falcon HTS FluoroBlock cell culture inserts to measure migration of cells 
through membranes in presence or absence of angiogenesis enhancer or suppressors; and 
tubule formation assays based on the formation of tubular structures by endothelial cells 
on Matrigel® (Becton Dickinson). Accordingly, an angiogenesis assay system may 
comprise a cell that expresses an MPTEN, and that optionally has defective PTEN/IGF 
function (e.g. PTEN/IGF is over-expressed or under-expressed relative to wild-type 
cells). A test agent can be added to the angiogenesis assay system and changes in 
angiogenesis relative to controls where no test agent is added, identify candidate 
PTEN/IGF modulating agents. In some embodiments of the invention, the angiogenesis 
assay may be used as a secondary assay to test a candidate PTEN/IGF modulating agents 
that is initially identified using another assay system. An angiogenesis assay may also be 
used to test whether MPTEN function plays a direct role in cell proliferation. For 
example, an angiogenesis assay may be performed on cells that over- or under-express 
MPTEN relative to wild type cells. Differences in angiogenesis compared to wild type 
cells suggests that the MPTEN plays a direct role in angiogenesis. U.S. Pat. Nos. 
5,976,782, 6,225,1 18 and 6,444,434, among others, describe various angiogenesis assays. 



Hypoxic induction. The alpha subunit of the transcription factor, hypoxia inducible 
factor-1 (HIF-i), is upregulated in tumor cells following exposure to hypoxia in vitro. 
Under hypoxic conditions, HIF-1 stimulates the expression of genes known to be 
important in tumour cell survival, such as those encoding glyolytic enzymes and VEGF. 
Induction of such genes by hypoxic conditions may be assayed by growing cells 
transfected with MPTEN in hypoxic conditions (such as with 0.1% 02, 5% C02, and 
balance N2, generated in a Napco 7001 incubator (Precision Scientific)) and normoxic 
conditions, followed by assessment of gene activity or expression by Taqman®. For 
example, a hypoxic induction assay system may comprise a cell that expresses an 
MPTEN, and that optionally has defective PTEN/IGF function (e.g. PTEN/IGF is over- 
expressed or under-expressed relative to wild-type cells). A test agent can be added to 
the hypoxic induction assay system and changes in hypoxic response relative to controls 
where no test agent is added, identify candidate PTEN/IGF modulating agents. In some 
embodiments of the invention, the hypoxic induction assay may be used as a secondary 
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assay to test a candidate PTEN/IGF modulating agents that is initially identified using 
another assay system. A hypoxic induction assay may also be used to test whether 
MPTEN Tunc lion plays a direct role in the hypoxic response. For example, a hypoxic 
induction assay may be performed on cells that over- or under-express MPTEN relative 
to wild type cells. Differences in hypoxic response compared to wild type cells suggests 
that the MPTEN plays a direct role in hypoxic induction. 

Cell adhesion. Cell adhesion assays measure adhesion of cells to purified adhesion 
proteins, or adhesion of cells to each other, in presence or absence of candidate 
modulating agents. Cell-protein adhesion assays measure the ability of agents to 
modulate the adhesion of cells to purified proteins. For example, recombinant proteins 
are produced, diluted to 2.5g/mL in PBS, and used to coat the wells of a microtiter plate. 
The wells used for negative control are not coated. Coated wells are then washed, 
blocked with \% BSA, and washed again. Compounds are diluted to 2x final test 
concentration and added to the blocked, coated wells. Cells are then added to the wells, 
and the unbound cells are washed off. Retained cells are labeled directly on the plate by 
adding a membrane-permeable fluorescent dye, such as calcein-AM, and the signal is 
quantified in a fluorescent microplate reader. 

Cell-cell adhesion assays measure the ability of agents to modulate binding of cell 
adhesion proteins with their native ligands. These assays use cells that naturally or 
recombinantly express the adhesion protein of choice. In an exemplary assay, cells 
expressing the cell adhesion protein are plated in wells of a multiwell plate. Cells 
expressing the ligand are labeled with a membrane-permeable fluorescent dye, such as 
BCECF , and allowed to adhere to the monolayers in the presence of candidate agents. 
Unbound cells are washed off, and bound cells are detected using a fluorescence plate 
reader. 

High-throughput cell adhesion assays have also been described. In one such assay, 
small molecule ligands and peptides are bound to the surface of microscope slides using a 
microarray spotter, intact cells are then contacted with the slides, and unbound cells are 
washed off. In this assay, not only the binding specificity of the peptides and modulators 
against cell lines are determined, but also the functional cell signaling of attached cells 
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using immunofluorescence techniques in situ on the microchip is measured (Falsey JR et 
al., Bioconjug Chem. 2001 May-Jun;12(3):346-53). 

Tubulogenesis. Tubulogenesis assays monitor the ability of cultured cells, 
generally endothelial cells, to form tubular structures on a matrix substrate, which 
generally simulates the environment of the extracellular matrix. Exemplary substrates 
include Matrigel™ (Becton Dickinson), an extract of basement membrane proteins 
containing laminin, collagen IV, and heparin sulfate proteoglycan, which is liquid at 4°C 
and forms a solid gel at 37° C. Other suitable matrices comprise extracellular 
components such as collagen, fibronectin, and/or fibrin. Cells arc stimulated with a pro- 
angiogenic stimulant, and their ability to form tubules is detected by imaging. Tubules 
can generally be detected after an overnight incubation with stimuli, but longer or shorter 
time frames may also be used. Tube formation assays arc well known in the art (e.g., 
Jones MK et al., 1999, Nature Medicine 5:1418-1423). These assays have traditionally 
involved stimulation with serum or with the growth factors FGF or VEGF. Serum 
represents an undefined source of growth factors. In a preferred embodiment, the assay is 
performed with cells cultured in serum free medium, in order to control which process or 
pathway a candidate agent modulates. Moreover, we have found that different target 
genes respond differently to stimulation with different pro-angiogenic agents, including 
inflammatory angiogenic factors such as TNF-alpa. Thus, in a further preferred 
embodiment, a tubulogenesis assay system comprises testing an MPTEN's response to a 
variety of factors, such as FGF, VEGF, phorbol myristate acetate (PMA), TNF-alpha, 
ephrin, etc. 

Cell Migration. An invasion/migration assay (also called a migration assay) tests the 
ability of cells to overcome a physical barrier and to migrate towards pro-angiogenic 
signals. Migration assays are known in the art (e.g., Paik JH et al., 2001, J Biol Chem 
276:1 1830-1 1837). In a typical experimental set-up, cultured endothelial cells are seeded 
onto a matrix-coated porous lamina, with pore sizes generally smaller than typical cell 
size. The matrix generally simulates the environment of the extracellular matrix, as 
described above. The lamina is typically a membrane, such as the transwell 



31 



61W70764 -£IS.»fl3 

EX03-029P 

polycarbonate membrane (Corning Costar Corporation, Cambridge, MA), and is 
generally part of an upper chamber that is in fluid contact with a lower chamber 
containing pro-angiogenic stimuli. Migration is generally assayed after an overnight 
incubation with stimuli, but longer or shorter time frames may also be used. Migration is 
assessed as the number of cells that crossed the lamina, and may be detected by staining 
cells with hemotoxylin solution (VWR Scientific, South San Francisco, CA), or by any 
other method for determining cell number. In another exemplary set up, cells arc 
fluorescently labeled and migration is detected using fluorescent readings, for instance 
using the Falcon HTS FluoroBlok (Becton Dickinson). While some migration is 
observed in the absence of stimulus, migration is greatly increased in response to pro- 
angiogenic factors. As described above, a preferred assay system for migration/invasion 
assays comprises testing an MPTEN's response to a variety of pro-angiogenic factors, 
including tumor angiogenic and inflammatory angiogenic agents, and culturing the cells 
in serum free medium. 

Sprouting assay. A sprouting assay is a three-dimensional in vitro angiogenesis 
assay that uses a cell-number defined spheroid aggregation of endothelial cells 
("spheroid"), embedded in a collagen gel-based matrix. The spheroid can serve as a 
starting point for the sprouting of capillary-like structures by invasion into the 
extracellular matrix (termed "cell sprouting") and the subsequent formation of complex 
anastomosing networks (Korff and Augustin, 1999, J Cell Sci 1 12:3249-58). In an 
exemplary experimental set-up, spheroids are prepared by pipetting 400 human umbilical 
vein endothelial cells into individual wells of a nonadhesive 96-well plates to allow 
overnight spheroidal aggregation (Korff and Augustin: J Cell Biol 143: 1341-52, 1998). 
Spheroids are harvested and seeded in 900/il of methocel-collagen solution and pipetted 
into individual wells of a 24 well plate to allow collagen gel polymerization. Test agents 
are added after 30 min by pipetting 100 fi\ of 10-fold concentrated working dilution of 
the test substances on top of the gel. Plates are incubated at 37°C for 24h. Dishes are 
fixed at the end of the experimental incubation period by addition of paraformaldehyde. 
Sprouting intensity of endothelial cells can be quantitated by an automated image 
analysis system to determine the cumulative sprout length per spheroid. 
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Primary assays for antibody modulators 

For antibody modulators, appropriate primary assays test is a binding assay that tests 
the antibody's affinity to and specificity for the MPTEN protein. Methods for testing 
antibody affinity and specificity are well known in the art (Harlow and Lane, 1988, 1999, 
supra). The enzyme-linked immunosorbant assay (ELLS A) is a preferred method for 
detecting MPTEN- specific antibodies; others include FACS assays, radioimmunoassays, 
and fluorescent assays. 

In some cases, screening assays described for small molecule modulators may also be 
used to test antibody modulators. 

Primary assays for nucleic acid modulators 
For nucleic acid modulators, primary assays may test the ability of the nucleic acid 
modulator to inhibit or enhance MPTEN gene expression, preferably mRNA expression. 
In general, expression analysis comprises comparing MPTEN expression in like 
populations of cells (e.g., two pools of cells that endogenously or recombinantly express 
MPTEN) in the presence and absence of the nucleic acid modulator. Methods for 
analyzing mRNA and protein expression are well known in the art. For instance, 
Northern blotting, slot blotting, ribonuclease protection, quantitative RT-PCR (e.g., using 
the TaqMan®, PE Applied Biosystems), or microarray analysis may be used to confirm 
that MPTEN mRNA expression is reduced in cells treated with the nucleic acid 
modulator (e.g., Current Protocols in Molecular Biology (1994) Ausubel FM et al, eds., 
John Wiley & Sons, Inc., chapter 4; Freeman WM et al„ Biotechniques (1999) 26:1 12- 
125; Kallioniemi OP, Ann Med 2001, 33:142-147; Blohm DH and Guiseppi-Elie, A Curr 
Opin Biotechnol 2001, 12:41-47). Protein expression may also be monitored. Proteins 
are most commonly detected with specific antibodies or antisera directed against either 
the MPTEN protein or specific peptides. A variety of means including Western blotting, 
ELISA, or in situ detection, are available (Harlow E and Lane D, 1988 and 1999, supra). 

In some cases, screening assays described for small molecule modulators, 
particularly in assay systems that involve MPTEN mRNA expression, may also be used 
to test nucleic acid modulators. 
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Secondary Assays 

Secondary assays may be used to further assess the activity of MPTEN-modulating 
agent identified by any of the above methods to confirm that the modulating agent affects 
MPTEN in a manner relevant to the PTEN/IGF pathway. As used herein, MPTEN- 
modulating agents encompass candidate clinical compounds or other agents derived from 
previously identified modulating agent. Secondary assays can also be used to test the 
activity of a modulating agent on a particular genetic or biochemical pathway or to test 
the specificity of the modulating agent's interaction with MPTEN. 

Secondary assays generally compare like populations of cells or animals {e.g., two 
pools of cells or animals that endogenously or recombinantly express MPTEN) in the 
presence and absence of the candidate modulator. In general, such assays test whether 
treatment of cells or animals with a candidate MPTEN-modulating agent results in 
changes in the PTEN/IGF pathway in comparison to untreated (or mock- or placebo- 
treated) cells or animals. Certain assays use "sensitized genetic backgrounds", which, as 
used herein, describe cells or animals engineered for altered expression of genes in the 
PTEN/IGF or interacting pathways. 

Cell-based assays 

Cell based assays may detect endogenous PTEN/IGF pathway activity or may rely on 
recombinant expression of PTENAGF pathway components. Any of the aforementioned 
assays may be used in this cell-based format. Candidate modulators are typically added 
to the cell media but may also be injected into cells or delivered by any other efficacious 
means. 

Animal Assays 

A variety of non-human animal models of normal or defective PTEN/IGF pathway 
may be used to test candidate MPTEN modulators. Models for defective PTEN/IGF 
pathway typically use genetically modified animals that have been engineered to mis- 
express (e.g., over-express or lack expression in) genes involved in the PTENAGF 
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pathway. Assays generally require systemic delivery of the candidate modulators, such 
as by oral administration, injection, etc. 

In a preferred embodiment, PTEN/IGF pathway activity is assessed by monitoring 
neovascularization and angiogenesis. Animal models with defective and normal 
PTEN/IGF are used to test the candidate modulator's affect on MPTEN in Matrigel® 
assays. Matrigel® is an extract of basement membrane proteins, and is composed 
primarily of laminin, collagen IV, and heparin sulfate proteoglycan. It is provided as a 
sterile liquid at 4°C, but rapidly forms a solid gel at 37° C. Liquid Matrigel® is mixed 
with various angiogenic agents, such as bFGF and VEGF, or with human tumor cells 
which over-express the MPTEN. The mixture is then injected subcutaneously(SC) into 
female alhymic nude mice (Taconic, Germantown, NY) to support an intense vascular 
response. Mice with Matrigel® pellets may be dosed via oral (PO), intraperitoneal (IP), 
or intravenous (TV) routes with the candidate modulator. Mice are euthanized 5-12 days 
post-injection, and the Matrigel® pellet is harvested for hemoglobin analysis (Sigma 
plasma hemoglobin kit). Hemoglobin content of the gel is found to correlate the degree 
of neovascularization in the gel. 

In another preferred embodiment, the effect of the candidate modulator on MPTEN is 
assessed via tumorigenicity assays. Tumor xenograft assays are known in the art (see, 
e.g., Ogawa K et ah, 2000, Oncogene 19:6043-6052). Xenografts are typically implanted 
SC into female athymic mice, 6-7 week old, as single cell suspensions either from a pre- 
existing tumor or from in vitro culture. The tumors which express the MPTEN t 
endogenously are injected in the flank, 1 x 10 5 to 1 x 10 7 cells per mouse in a volume of 
100 uL using a 27gauge needle. Mice are then ear tagged and tumors are measured twice 
weekly. Candidate modulator treatment is initiated on the day the mean tumor weight 
reaches 100 mg. Candidate modulator is delivered IV, SC, IP, or PO by bolus 
administration. Depending upon the pharmacokinetics of each unique candidate 
modulator, dosing can be performed multiple times per day. The tumor weight is 
assessed by measuring perpendicular diameters with a caliper and calculated by 
multiplying the measurements of diameters in two dimensions. At the end of the 
experiment, the excised tumors maybe utilized for biomarker identification or further 
analyses. For immunohistochemistry staining, xenograft tumors are fixed in 4% 
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paraformaldehyde, 0.1M phosphate, pH 7.2, for 6 hours at 4°C, immersed in 30% sucrose 
in PBS, and rapidly frozen in isopentane cooled with liquid nitrogen. 

In another preferred embodiment, tumorogenicity is monitored using a hollow 
fiber assay, which is described in U.S. Pat No. US 5,698,413. Briefly, the method 
comprises implanting into a laboratory animal a biocompatible, semi-permeable 
encapsulation device containing target cells, treating the laboratory animal with a 
candidate modulating agent, and evaluating the target cells for reaction to the candidate 
modulator. Implanted cells are generally human cells from a pre-existing tumor or a 
tumor cell line. After an appropriate period of time, generally around six days, the 
implanted samples are harvested for evaluation of the candidate modulator. 
Tumorogenicity and modulator efficacy may be evaluated by assaying the quantity of 
viable cells present in the macrocapsule, which can be determined by tests known in the 
art, for example, MTT dye conversion assay, neutral red dye uptake, trypan blue staining, 
viable cell counts, the number of colonies formed in soft agar, the capacity of the cells to 
recover and replicate in vitro, etc. 

In another preferred embodiment, a tumorogenicity assay use a transgenic 
animal, usually a mouse, carrying a dominant oncogene or tumor suppressor gene 
knockout under the control of tissue specific regulatory sequences; these assays are 
generally referred to as transgenic tumor assays. In a preferred application, tumor 
development in the transgenic model is well characterized or is controlled. In an 
exemplary model, the "RIPl-Tag2" transgene, comprising the SV40 large T-antigen 
oncogene under control of the insulin gene regulatory regions is expressed in pancreatic 
beta cells and results in islet cell carcinomas (Hanahan D, 1985, Nature 315: 1 15-122; 
Parangi S et al, 1996, Proc Natl Acad Sci USA 93: 2002-2007; Bergers G et al, 1999, 
Science 284:808-812). An "angiogenic switch," occurs at approximately five weeks, as 
normally quiescent capillaries in a subset of hyperproliferative islets become angiogenic. 
The RIP1-TAG2 mice die by age 14 weeks. Candidate modulators may be administered 
at a variety of stages, including just prior to the angiogenic switch (e.g., for a model of 
tumor prevention), during the growth of small tumors (e.g., for a model of intervention), 
or during the growth of large and/or invasive tumors (e.g., for a model of regression). 
Tumorogenicity and modulator efficacy can be evaluating life-span extension and/or 
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tumor characteristics, including number of tumors, tumor size, tumor morphology, vessel 
density, apoptotic index, etc. 

Diagnostic and therapeutic uses 

Specific MPTEN-modulating agents are useful in a variety of diagnostic and 
therapeutic applications where disease or disease prognosis is related to defects in the 
PTEN/1GF pathway, such as angiogenic, apoptotic, or cell proliferation disorders. 
Accordingly, the invention also provides methods for modulating the PTEN/IGF pathway 
in a cell, preferably a cell pre-determined to have defective or impaired PTEN/IGF 
function (e.g. due to overexpression, undercxpression, or misexpression of PTEN/IGF, or 
due to gene mutations), comprising the step of administering an agent to the cell that 
specifically modulates MPTEN activity. Preferably, the modulating agent produces a 
detectable phenotypic change in the cell indicating that the PTEN/IGF function is 
restored. The phrase "function is restored", and equivalents, as used herein, means that 
the desired phenotype is achieved, or is brought closer to normal compared to untreated 
cells. For example, with restored PTEN/IGF function, cell proliferation and/or 
progression through cell cycle may normalize, or be brought closer to normal relative to 
untreated cells. The invention also provides methods for treating disorders or disease 
associated with impaired PTEN/IGF function by administering a therapeutically effective 
amount of an MPTEN -modulating agent that modulates the PTEN/IGF pathway. The 
invention further provides methods for modulating MPTEN function in a cell, preferably 
a cell pre-determined to have defective or impaired MPTEN function, by administering 
an MPTEN -modulating agent. Additionally, the invention provides a method for 
treating disorders or disease associated with impaired MPTEN function by administering 
a therapeutically effective amount of an MPTEN -modulating agent. 

The discovery that MPTEN is implicated in PTEN/IGF pathway provides for a 
variety of methods that can be employed for the diagnostic and prognostic evaluation of 
diseases and disorders involving defects in the PTEN/IGF pathway and for the 
identification of subjects having a predisposition to such diseases and disorders. 

Various expression analysis methods can be used to diagnose whether MPTEN 
expression occurs in a particular sample, including Northern blotting, slot blotting, 
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ribonuclease protection, quantitative RT-PCR, and microarray analysis, (e.g., Current 
Protocols in Molecular Biology (1994) Ausubel FM et al., eds. t John Wiley & Sons, Inc., 
chapter 4; Freeman WM et a/., Biulechniques (1999) 26: 1 12-125; Kallioniemi OP, Ann 
Med 2001, 33:142-147; Blohm and Guiseppi-Elie, Curr Opin Biotechnol 2001, 12:41- 
47). Tissues having a disease or disorder implicating defective PTEN/IGF signaling that 
express an MPTEN, are identified as amenable to treatment with an MPTEN modulating 
agent. In a preferred application, the PTEN/IGF defective tissue overexprcsses an 
MPTEN relative to normal tissue. For example, a Northern blot analysis of mRNA from 
tumor and normal cell lines, or from tumor and matching normal tissue samples from the 
same patient, using full or partial MPTEN cDNA sequences as probes, can determine 
whether particular tumors express or overexpress MPTEN. Alternatively, the TaqMan® 
is used for quantitative RT-PCR analysis of MPTEN expression in cell lines, normal 
tissues and tumor samples (PE Applied Biosystems). 

Various other diagnostic methods may be performed, for example, utilizing reagents 
such as the MPTEN oligonucleotides, and antibodies directed against an MPTEN, as 
described above for: (1) the detection of the presence of MPTEN gene mutations, or the 
detection of either over- or under-expression of MPTEN mRNA relative to the non- 
disorder state; (2) the detection of either an over- or an under-abundance of MPTEN gene 
product relative to the non-disorder state; and (3) the detection of perturbations or 
abnormalities in the signal transduction pathway mediated by MPTEN. 

Thus, in a specific embodiment, the invention is drawn to a method for diagnosing a 
disease or disorder in a patient that is associated with alterations in MPTEN expression, 
the method comprising: a) obtaining a biological sample from the patient; b) contacting 
the sample with a probe for MPTEN expression; c) comparing results from step (b) with 
a control; and d) determining whether step (c) indicates a likelihood of the disease or 
disorder. Preferably, the disease is cancer. The probe may be either DNA or protein, 
including an antibody. 

EXAMPLES 

The following experimental section and examples are offered by way of illustration 
and not by way of limitation. 
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I. Drosophila PTEN/TGF screen 

RNA interference (RNAi) was used to create dPTEN-deficient cultured 
Drosophila cells (Schneider S2 cells (Schneider, 1. (1972) J. Embryol. Exp. Morph. 27, 
363), adapted to serum-free media, from Invitrogen Corp., Carlsbad, CA). Cells were 
treated for 3 days with dPTEN double stranded RNA (dsRNA) or a control dsRNA 
representing sequences from a Renilla luciferase cDNA. After a 3 day dsRNA 
pretreatment, 1 jxM bovine insulin was added to cells treated with dPTEN dsRNA to 
provide additional stimulation of the IGF/insulin pathway. PTEN-deficient, insulin- 
stimulated cells and control cells were plated in 384-well format and dsRNA representing 
approximately 6000 different Drosophila genes were added to individual wells. A cell 
proliferation assay (AqueousOne™ assay - Promega Corp, Madison, WI) was used to 
quantify cell viability after 96-hours incubation. For each of the greater than 6000 
dsRNA sequences tested in this manner, cell viability data was obtained on dPTEN- 
deficient, insulin-stimulated cells (insulin and dPTEN dsRNA-treated) and control cells 
(Renilla luciferase dsRN A -treated). Comparison of this data for each dsRNA identified 
dsRNA sequences that preferentially reduced the viability of insulin and dPTEN dsRNA 
treated cells. 

II. Analysis of Table 1 

BLAST analysis (Altschul et al., supra) was employed to identify orthologs of 
Drosophila modifiers. The columns "MPTEN symbol", and "MPTEN name aliases " 
provide a symbol and the known name abbreviations for the Targets, where available, 
from Genbank. "MPTEN RefSeq_NA or GI_NA'\ "MPTEN GL.AA", "MPTEN 
NAME", and "MPTEN Description" provide the reference DNA sequences for the 
MPTENs as available from National Center for Biology Information (NCBI), MPTEN 
protein Genbank identifier number (GI#), MPTEN name, and MPTEN description, all 
available from Genbank, respectively. The length of each amino acid is in the "MPTEN 
Protein Length" column. 
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Names and Protein sequences of Drosophila modifiers of PTEN/IGF from screen 
(Example I), are represented in the "Modifier Name" and "Modifier GI_AA" column by 
GI#, respectively. 



Table 1 



MPTEN 
symbol 


MPTEN 
name aliases 


MPTEN 
RefSeq_NA 
or GLNA 


MPTEN 
GI_AA 


MPTEN 
NAME 


MPTEN 
Description 


MPTEN 

Protein 

Length 


Modifier 
Name 


Modifier 
GI_AA 


CNOT7 


CNOT7 | 

CAF1 | 

hCAF-1 | 

DTG1 

binding 

factor 1 | 

carbon 

catabolite 

repressor 

protein 

(CCR4)- 

associative 

factor 1 1 

ccr4-not 

transcription 
complex, 
subunit 7 


NMJM3354 
|NMJ)5402 
6 


7019339 


CCR4-NOT 
transcription 
complex, 
subunit 7 


poly(A)- 

specific 

ribonuclease 

; signal 

transducer; 

transcription 

cofactor; 

transcription 

factor; 

transcription 

factor 


262 


CG5684 


21357541 


CNOT8 


CNOT8 | 
CAF1| 
POP2| 
CALIF | 
hCAFl | 
POP2 (yeast 
homolog) | 
PGK 
promoter 
directed over 
production | 
CCR4-NOT 
transcription 
complex, 
subunit 8 


NM_004779 


24496778 


CCR4-NOT 
transcription 
complex, 
subunit 8 


transcription 
factor 


292 


CG5684 


21357541 
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COPA 


COPA| 
HEP-COP | 
*enin | alpha 
:oat protein | 
:oatomer 
protein 
complex, 
subunit 
alpha 


NTM_004371 


4758030 


coatomcr 

protein 

complex, 

subunit 

alpha 


RNA 
binding; 
ntracellular 
transporter 


1224 


alphaCop 


24655452 


FASN 


FASN | FAS 
OA-5l9| 
MGC14367 | 
MGC 15706 | 
fatty acid 
synthase 


NMJXVW04 


21618359 


fatty acid 
synthase 


acyl-CoA 

thioesterase 

11; acyl 

carrier; 

enoyl-[acyl- 

carrier 

protein] 

reductase; 

fatty-acid 

synthase; 

fatty-acid 

synthase 


2509 


BcDNA:G 
H07626 


19920632 


HSPA9B 


HSPA9B | 
CSA | MOT 
|MOT2| 
GRP75| 
HSPA9 | 
PBP74| 
mot-2 | 
MGC4500) 
MTHSP75 | 
mthsp75 | 
p66- mortal in 
| mortal in, 
perinuclear | 
heat shock 
70kD 
protein 9 | 
peptide- 
binding 
protein 74 | 
75 kDa 
glucose 
regulated 
protein | 
stress-70 
protein, 
mitochondri 
al|heat 
shock 70kDa 
protein 9B 
( mortal in-2) 


NM_004134 


24234688 


heat shock 
70kDa 
protein 9B 
(mortalin-2) 


chaperone 


679 


Hsc70-5 


24653595 
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LOC3445 
10 


LOC344510 
similar to 
Stress-70 
protein, 
mitochondri 
al precursor 
(75 kDa 
glucose 
regulated 
protein) 
(GRP75) 
(Peptide- 
binding 
protein 74) 
[PBP74) 
[Mortalin) 
(MOT) | na 


XM_293066t 


19732424 1; 

1 


iimilar to 

Stress-70 

protein, 

mitochondri 

al precursor 

(75 kDa 

glucose 

regulated 

protein) 

(ORP75) 

[Peptide- 

binding 

protein 74) 

(PBP74) 

(Moitalin) 

(MOT) 


la 


529 


flsc70-5 


M653595 


KIAA100 

7 


KIAA1007 | 

adrenal 
gland 

protein AD- 
005 | 

KIAA1007 
protein 


NMJH6284 


20521724 


KIAA1007 
protein 


transcription 
regulator 


1835 


CGI 884 


24652 U2 


KIAA160 
4 


KIAAI604 | 

KIAA1604 

protein 


XM.034594 


10047283 


KIAAJ604 
protein 


na 


937 


CGI 2750 


24584968 


FKSG17 


FKSG17 


NM_032031 


14042953 


FKSG17 


nucleic acid 
binding; 
nucleic acid 
binding 


213 


Nacalpha 


19549608 


KIAA036 
3 


KIAA0363 | 

KIAA0363 

protein 


XM_ 166571 


2224667 


KIAA0363 
protein 


na 


1522 


Nacalpha 


19549608 


LOC1526 
82 


LOC152682 
| similar to 
nascent- 
po lypcpt ide- 
as so ciated 
complex 
alpha 

polypeptide 
[Homo 
sapiens] | na 


XM_087502 


18557511 


similar to 

nascent- 

polypeptide- 

associated 

complex 

alpha 

polypeptide 

[Homo 

sapiens] 


na 


215 


Nacalpha 


19549608 
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NACA 1 
r 

I 

i 
i 
i 
I 


^JACA | I 

lasccnt- 

x>lypcptidc- 

issociated 

;omplex 

ilpha 

polypeptide 


WL005594 1 


5031931 i 

I 

i 
( 


lascent- i 
>olypeptide- 1 
issociatcd 
complex 
ilpha 

xilypcptide 


lucleic acid 
rinding 


l\5 b 


slacalpha 


19549608 


PMPCB 


PMPCB | 

MPPB| 

MPP11 | 

MPPP52 | 

mitochondri 

al processing 

peptidase- 

beta | 

peptidase 

(mitochondri 

al 

processing) 
beta 


NMJXM279 


4758734 


peptidase 

mitochondri 

al 

processing) 
seta 


)eta- 

mitochondri 
al processing 
peptidase 


*89 


CG3731 


24646943 


UQCRC1 


UQCRC1 1 
D3S3191 | 
ubiquinol- 
cytochrome 
c reductase 
core protein 
I 


NM_003365 


4507841 


ubiquinoU 
cytochrome 
c reductase 
core protein 
I 


electron 
transporter; 
ubiquinol- 
cytochrome 
c reductase 


480 


CG3731 


24646943 


P0M121 


POM121| 

MGC3792 | 

KIAA0618) 

DKFZP586 

G1822| 

DKFZP586P 

2220 |P 145 

nuclear pore 

membrane 

protein 121 

kDa| 

nuclear 

envelope 

pore 

membrane 
protein POM 
121 1 
POM 121 
membrane 
glycoprotein 
(rat) 


NM_172020 


26051278 


POM121 
membrane 
glycoprotein 
(rat) 


na 


984 


CG14712 


21356627 



43 



EX03-029P 



PSMA2 



PSMA2 
MU | HC3 | 
PSC2| 
PMSA2| 
macropain 
subunil C3 | 
proteasome 
subunit HC3 
| proteasome 
component 
C3| 

multicatalyti 
c 

endopeptida 
se complex 
subunit C3 | 
proteasome 
(prosome, 
macropain) 
subunit, 
alpha type, 2 



NM 002787 4506181 



PSMA3 | 
HC8|PSC3 
MGC12306|2 
MGC32631 
macropain 
subunit C8 | 
proteasome 
subunit C8 | 
multicatalyti 
c 

endopeptida 
se complex 
subunit C8 | 
proteasome 
(prosome, 
macropain) 
subunit, 
alpha type, 3 



proteasome 
(prosome, 
macropain) 
subunit, 
alpha type, 2 



NM_002788 145061 83 
|NM_15213 



proteasome 
(prosome, 
macropain) 
subunit, 
alpha type, 3 



234 



Pros25 



255 



17737927 



Prosalpha 



24652204 



44 



EX03-029P 



fib :Q *i- 7* : Q y { & fib - :03S JL «-0 3 



PSMBl 



PSMB1 | 
HC5 | PSC5 
|PMSB1 | 
FU25321 | 
macropain 
subunit C5 | 
protcasomc 
gamma 
chain | 
proteasome 
subunit HC5 
| proteasome 
component 
C5| 

multicatalyti 
c 

endopeptida 
se complex 
subunit. CS | 
proteasome 
(prosome, 
macropain) 
subunit, beta 
type, 1 



NM.002793 



4506193 



proteasome peptidase 241 
(prosome, 
macropain) 
subunit, beta 
type, 1 



Pros26 



17737605 



LOC2218LOC221890 



XM_1 67259 20539647 



90 



| similar to 
protease 
(prosome, 
macropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus] 



similar to 
protease 
(prosome, 
macropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus! 



257 



Pros26.4 



24649446 



LOC2852LOC285213 



XM_2 10528 27479738 



13 



similar to 
protease 
(prosome, 
macropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus] | 
na 



similar to 
protease 
(prosome, 
macropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus] 



241 



Pros26.4 



24649446 
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LOC3439 
28 

( 
i 


.OC343928 2 
similar to 
srotcase 
prosome, ! 
nacropain) 
Z6S subunit, 
ATPase 1 
[Rattus 
lorvegicusl | 
na 


KM_292848t 


J9727184 


similar to i 
protease 
prosome, 
■nacropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus] 


ia <■ 


U9 h 


>ros26.4 fa 


24649446 


LOC3456 
45 


LOC345645 
similar to 
protease 
[prosome, 
macropain) 
26S subunit. 
ATPase 1 
[Rattus 
norvegicus] | 
na 


XM_293923 


29736610 


similar to 
protease 
(prosome, 
macropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus] 


na 


415 


Pro$26.4 


24649446 


LOC3459 
86 


LOC345986 

| similar to 

protease 

(prosome, 

macropain) 

26S subunit, 

ATPase 1 

[Rattus 

norvegicusj 

na 


XM.294034 


29740164 


similar to 
protease 
(prosome, 
macropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus] 


na 


233 


Pros26.4 


24649446 


LOC3466 
82 


LOC346682 

| similar to 

protease 

(prosome, 

macropain) 

26S subunit, 

ATPase 1 

[Rattus 

norvegicus] 

na 


XM.294368 


29737532 


similar to 
protease 
(prosome, 
macropain) 
26S subunit, 
ATPase 1 
[Rattus 
norvegicus] 


na 


366 


Pros26.4 


24649446 
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PSMCl 



PSMC1 |S4 
| P 56| 
P26S4| 
MGC8541 ( 
MGC24583 
26S protease 
regulatory 
subunit 4 1 
proteasome 
26S subunit, 
ATPase, 1 | 
proteasome 
(prosome, 
macropain) 
26S subunit, 
ATPase, I 



NM 002802 24430151 



PSMD2 



PSMD2 1 S2 
P97| 
TRAP2 | 
MGC14274 | 
55.11 
protein | 
TNFR- 
associated 
protein 2 | 
26S 

proteasome 
subunit p97 | 
26S 

proteasome 
regulatory 
subunit S2 | 
26S 

proteasome 
non-ATPase 
regulatory 
subunit 2 | 
tumor 
necrosis 
factor 
receptor- 
associated 
protein 2 | 
proteasome 
(prosome, 
macropain) 
26S subunit, 
non-ATPase 
2 



NM_0O28O8 



25777602 



proteasome 
(prosome, 
macropain) 
26S subunit, 
non-ATPase, 



proteasome 


proteasome 


[prosome, 


ATPase; 


macropain) 


proteasome 


26S subunit, 


ATPase; 


ATPase, 1 


proteasome 




ATPase; 




proteasome 




ATPase; 




protein 




binding; 




poly- 




glutamine 




tract binding 



440 



Pros26.4 24649446 



protein 
binding; 
proteasome 
activator 



908 



Rpnl 



21356859 
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RANBP2|RANBP2| |NM_006267 
NUP358 | 
P270| 
nucleoporin 
358 | nuclear 
pore 
complex 
protein 
Nup358 | 
RAN 
binding 
protein 2 



6382079 Iran 



binding 
protein 2 



RAN protein 
binding; 
SUMO 
Hgase 



3224 



CGI 1856 24649980 



EL High-Throughput In Vitro Fluorescence Polarization Assay 
Fluorescently-labeled MPTEN peptide/substrate are added to each well of a 96- 
well microliter plate, along with a test agent in a test buffer (10 mM HEPES, 10 mM 
NaCl, 6 mM magnesium chloride, pH 7.6). Changes in fluorescence polarization, 
determined by using a Fluorolite FPM-2 Fluorescence Polarization Microtiter System 
(Dynatech Laboratories, Inc), relative to control values indicates the test compound is a 
candidate modifier of MPTEN activity. 

IV. High-Throughput In Vitro Binding Assay. 
33 P-labe1ed MPTEN peptide is added in an assay buffer (100 mM KCI, 20 mM 
HEPES pH 7.6, 1 mM MgCI 2 , 1% glycerol, 0.5% NP-40, 50 mM beta-mercaptoethanol, 1 
mg/ml BSA, cocktail of protease inhibitors) along with a test agent to the wells of a 
Neutralite-avidin coated assay plate and incubated at 25°C for 1 hour. Biotinylated 
substrate is then added to each well and incubated for 1 hour. Reactions are stopped by 
washing with PBS, and counted in a scintillation counter. Test agents that cause a 
difference in activity relative to control without test agent are identified as candidate 
PTEN/IGF modulating agents. 

V. Immunoprecipitations and Immunoblotting 

For coprecipitation of transfected proteins, 3 x 10 6 appropriate recombinant cells 
containing the MPTEN proteins are plated on 10-cm dishes and transfected on the 
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following day wilh expression conslructs. The total amount of DNA is kept constant in 
each iransfection by adding empty vector. After 24 h, cells are collected, washed once 
with phosphate-buffered saline and lysed for 20 min on ice in 1 ml of lysis buffer 
containing 50 mM Hepes, pH 7.9, 250 mM NaCl, 20 mM -glycerophosphate, 1 mM 
sodium orthovanadate, 5 mM p-nitrophenyl phosphate, 2 mM dithiothrcitol, protease 
inhibitors (complete, Roche Molecular Biochemicals), and 1% Nonidet P-40. Cellular 
debris is removed by centrifugation twice at 15,000 x g for 15 min. The cell lysate is 
incubated with 25 /il of M2 beads (Sigma) for 2 h at 4 °C with gentle rocking. 

After extensive washing with lysis buffer, proteins bound to the beads are 
solubilized by boiling in SDS sample buffer, fractionated by SDS-polyacrylamide gel 
electrophoresis, transferred to polyvinylidene difluoride membrane and blotted with the 
indicated antibodies. The reactive bands are visualized with horseradish peroxidase 
coupled to the appropriate secondary antibodies and the enhanced chemiluminescence 
(ECL) Western blotting detection system (Amersham Pharmacia Biotech). 

VI. Kinase assay 

A purified or partially purified MPTEN is diluted in a suitable reaction buffer, 
e.g.. 50 mM Hepes, pH 7.5, containing magnesium chloride or manganese chloride (1-20 
mM) and a peptide or polypeptide substrate, such as myelin basic protein or casein (1-10 
/ig/ml). The final concentration of the kinase is 1-20 nM. The enzyme reaction is 
conducted in microliter plates to facilitate optimization of reaction conditions by 
increasing assay throughput. A 96-well microliter plate is employed using a final volume 
30-100 pi. The reaction is initiated by the addition of 33 P-gamma-ATP (0.5 /xCi/ml) and 
incubated for 0.5 to 3 hours at room temperature. Negative controls are provided by the 
addition of EDTA, which chelates the divalent cation (Mg2 + or Mn 2+ ) required for 
enzymatic activity. Following the incubation, the enzyme reaction is quenched using 
EDTA. Samples of the reaction are transferred to a 96-well glass fiber filter plate 
(MultiScreen, Millipore). The filters are subsequently washed with phosphate-buffered 
saline, dilute phosphoric acid (0.5%) or other suitable medium to remove excess 
radiolabeled ATP. Scintillation cocktail is added to the filter plate and the incorporated 
radioactivity is quantitated by scintillation counting (Wallac/Perkin Elmer). Activity is 
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defined by the amount of radioactivity detected following subtraction of the negative 
control reaction value (EDTA quench). 

VII. Expression analysis 

All cell lines used in the following experiments arc NCI (National Cancer 
Institute) lines, and are available from ATCC (American Type Culture Collection, 
Manassas, VA 201 10-2209). Normal and tumor tissues are obtained from Impath, UC 
Davis, Clontech, Stratagene, Ardais, Genome Collaborative, and Ambion. 

TaqMan analysis is used to assess expression levels of the disclosed genes in 
various samples. 

RNA is extracted from each tissue sample using Qiagen (Valencia, CA) RNeasy 
kits, following manufacturer's protocols, to a final concentration of 50ng/uJ. Single 
stranded cDNA is then synthesized by reverse transcribing the RNA samples using 
random hexamers and 500ng of total RNA per reaction, following protocol 4304965 of 
Applied Biosystems (Foster City, CA). 

Primers for expression analysis using TaqMan assay (Applied Biosystems, Foster 
City, CA) are prepared according to the TaqMan protocols, and the following criteria: a) 
primer pairs are designed to span introns to eliminate genomic contamination, and b) 
each primer pair produced only one product. Expression analysis is performed using a 
7900HT instrument. 

Taqman reactions are carried out following manufacturer's protocols, in 25 jil 
total volume for 96-well plates and 10 jil total volume for 384-well plates, using 300nM 
primer and 250 nM probe, and approximately 25ng of cDNA. The standard curve for 
result analysis is prepared using a universal pool of human cDNA samples, which is a 
mixture of cDNAs from a wide variety of tissues so that the chance that a target will be 
present in appreciable amounts is good. The raw data are normalized using 18S rRNA 
(universally expressed in all tissues and cells). 

For each expression analysis, tumor tissue samples are compared with matched 
normal tissues from the same patient. A gene is considered overexpressed in a tumor 
when the level of expression of the gene is 2 fold or higher in the tumor compared with 
its matched normal sample. In cases where normal tissue is not available, a universal 
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poo) of cDNA samples is used instead. In these cases, a gene is considered 
ovcrexpressed in a tumor sample when the difference of expression levels between a 
tumor sample and the average of all normal samples from the same tissue type is greater 
than 2 times the standard deviation of all normal samples (i.e., Tumor - averagc(all 
normal samples) > 2 x STDEV(all normal samples) ). 

A modulator identified by an assay described herein can be further validated for 
therapeutic effect by administration to a tumor in which the gene is overcxpressed. A 
decrease in tumor growth confirms therapeutic utility of the modulator. Prior to treating 
a patient with the modulator, the likelihood that the patient will respond to treatment can 
be diagnosed by obtaining a tumor sample from the patient, and assaying for expression 
of the gene targeted by the modulator. The expression data for the gene(s) can also be 
used as a diagnostic marker for disease progression. The assay can be performed by 
expression analysis as described above, by antibody directed to the gene target, or by any 
other available detection method. 
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WHAT IS CLAIMED IS: 

1. A method of identifying a candidate PTEN/IGF pathway modulating agent, said 
method comprising the steps of: 

(a) providing an assay system comprising a MPTEN polypeptide or nucleic acid; 

(b) contacting the assay system with a test agent under conditions whereby, but 
for the presence of the test agent, the system provides a reference activity; and 

(c) detecting a test agent-biased activity of the assay system, wherein a difference 
between the test agent-biased activity and the reference activity identifies the test agent as 
a candidate PTEN/IGF pathway modulating agent. 

2. The method of Claim 1 wherein the assay system comprises cultured cells that express 
the MPTEN polypeptide. 

3. The method of Claim 2 wherein the cultured cells additionally have defective 
PTEN/IGF function. 

4. The method of Claim 1 wherein the assay system includes a screening assay 
comprising a MPTEN polypeptide, and the candidate test agent is a small molecule 
modulator. 

5. The method of Claim 4 wherein the assay is a binding assay. 

6. The method of Claim 1 wherein the assay system is selected from the group 
consisting of an apoptosis assay system, a cell proliferation assay system, an angiogenesis 
assay system, and a hypoxic induction assay system. 

7. The method of Claim 1 wherein the assay system includes a binding assay comprising 
a MPTEN polypeptide and the candidate test agent is an antibody. 
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8. The method of Claim 1 wherein the assay system includes an expression assay 
comprising a MPTEN nucleic acid and the candidate test agent is a nucleic acid 
modulator, 

9. The method of claim 8 wherein the nucleic acid modulator is an antisense 
oligomer. 

10. The method of Claim 8 wherein the nucleic acid modulator is a PMO. 

11. The method of Claim 1 additionally comprising: 

(d) administering the candidate PTEN/1GF pathway modulating agent identified 
in (c) to a model system comprising cells defective in PTEN/IGF function and, detecting 
a phenotypic change in the model system that indicates that the PTEN/IGF function is 
restored. 

12. The method of Claim 1 1 wherein the model system is a mouse model with defective 
PTEN/IGF function. 

13. A method for modulating a PTEN/IGF pathway of a cell comprising 
contacting a cell defective in PTEN/IGF function with a candidate modulator that 
specifically binds to a MPTEN polypeptide, whereby PTEN/IGF function is restored. 

14. The method of claim 13 wherein the candidate modulator is administered to a 
vertebrate animal predetermined to have a disease or disorder resulting from a defect in 
PTEN/IGF function. 

15. The method of Claim 13 wherein the candidate modulator is selected from the 
group consisting of an antibody and a small molecule. 

16. The method of Claim 1, comprising the additional steps of: 
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(d) providing a secondary assay system comprising cultured cells or a non- 
human animal expressing MPTEN , 

(e) contacting the secondary assay system with the test agent of (b) or an 
agent derived therefrom under conditions whereby, but for the presence of the test agent 
or agent derived therefrom, the system provides a reference activity; and 

(0 detecting an agent-biased activity of the second assay system, 
wherein a difference between the agent-biased activity and the reference activity 

of the second assay system confirms the test agent or agent derived therefrom as a 

candidate PTEN/IGF pathway modulating agent, 

and wherein the second assay detects an agent-biased change in the PTEN/IGF 

pathway. 

17. The method of Claim 16 wherein the secondary assay system comprises cultured 
cells. 

18. The method of Claim 16 wherein the secondary assay system comprises a non- 
human animal. 

19. The method of Claim 18 wherein the non-human animal mis^expresses a PTEN/IGF 
pathway gene. 

20. A method of modulating PTEN/IGF pathway in a mammalian cell comprising 
contacting the cell with an agent that specifically binds a MPTEN polypeptide or nucleic 
acid. 

21. The method of Claim 20 wherein the agent is administered to a mammalian animal 
predetermined to have a pathology associated with the PTEN/IGF pathway. 

22. The method of Claim 20 wherein the agent is a small molecule modulator, a 
nucleic acid modulator, or an antibody. 

23. A method for diagnosing a disease in a patient comprising: 
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(a) obtaining a biological sample from the patient; 

(b) contacting the sample with a probe for MPTEN expression; 

(c) comparing results from step (b) with a control; 

(d) determining whether step (c) indicates a likelihood of disease. 

24. The method of claim 23 wherein said disease is cancer. 
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ABSTRACT OF THE DISCLOSURE 

Human MPTEN genes are identified as modulators of the PTEN/1GF pathway, 
and thus are therapeutic targets for disorders associated with defective PTEN/IGF 
function. Methods for identifying modulators of PTEN/IGF, comprising screening for 
agents that modulate the activity of MPTEN are provided 
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Nucleic Acid and Polypeptide sequences 

>gi | 17978498 | ref |NM_013354 .3 | Homo sapiens CCR4-N0T transcription 
complex, subunit 7 (CN0T7), transcript variant 1, mRNA 

GGCACGAGGGCAGCACGGTGCGGCGCAGCTCCTGCTCGCCTTTCCCTTCGCTGGGCGAGAGGTGTCTATG 
GGGCACCCGCTGCCGCCGCCGCTACCGCCACCGCCACCGCCACCGCCGCCGAGTGCTGTCTCTATGGCGA 
GGAGGAGWAGGAGGAGCGCGAGCTCAGCGACACAAGTACATAAATAAAGGATAAAATATTTTATGAAACA 
AATC TTC AATC AAGT ATAAC ATTTTGATGCTTGGC ATCT AGAC TC CCTTGTGC CC TC AC TATGC CAGCGG 
CAACTGTAGATCATAGCCAAAGAATTTGTGAAGTTTGGGCTTGCAACTTGGAT^ 

TCGTC AAGTTATCCG AAAATATAATT AC GTTGC TATGGAC AC C G AGTTTC CAGGTGTGGTTGCAAG AC C C 

ATTGGAGAATTCAGGAGCAATGCTGACTATCAATACCAACTATTGCGGTGTAATGTAGACTTGTTAAAGA 

TAATTCAGCTAGGACTGACATTTATGAATGAGCAAGGAGAATACCCTCCAGGAACTTCAACTTGGCAGTT 

TAATTTTAAATTT AATTTG ACGG AGG AC ATGTATGCC C AGGACTCTAT AG AGC TACTAAC AAC ATCTGGT 

ATCCAGTTTAAAAAACATGAGGAGGAAGGAATTGAAACCCAGTACTTTGCAGAACTTCTTATGACTTCTG 

GAGTGGTCCTCTGTGAAGGGGTCAAATGGTTGTCATTTCATAGCGGTTACGACTTTGGCTACTTAATCAA 

AATCCTAACCAACTCTAACTTGCCTGAAGAAGAACTTGACTTCTTTGAGATCCTTCGATTGTTTTTTCCT 

GTC ATTTATGATGTGAAGT ACC TC ATG AAG AGC TGC AAAAATCTC AAAGGTGG ATTAC AGG AGGTGGC AG 

AACAGTTAGAGCTGGAACGGATAGGACCACAACATCAGGCAGGATCTGATTCATTGCTCACAGGAATGGC 

CTTTTTCAAAATGAGAGAAATGTTCTTTGAAGATCATATTGATGATGCCAAATATTGTGGTCATTTC 

GGCCTTGGTTC TGGTTC ATC C T ATGTAC AGAATGGC AC AGGGAATGC AT ATG AAGAGGAAGCC AAC AAGC 

AGTC ATGAC ATGAAAT AGTCC TTTT ATTTTT ATTTC G AGC TAC AC AC ATGCTTGTAT AT AGGTTTTATCT 

CTGGTTGAATCCCTCGAACAATAGACAGTACCTTTCCCCCCCCTTTCATGGCCCATTTTATTGTCTGCCT 

TTC AGTAC TAAGT ATG AC C GTTCCT ATCTC AGATCTTAATAAAAAG AAAAAAAAAAACGC ATTC AGGTTA 

AATTTGGCCTTAATTTAATATACTTGTTAGCAAGCGTGTGTGACAGAGAGTCGGGAAAGCTACATCATTG 

AAT ATTTTGAT AAAC TTT ACC G ACTTGAGTTTGGTTTATTTTTCC C TTTTCCTAAATTAAC T AGC ACTGA 

CTGTAATTTATTTCCCTGTTTCACGTCTCTCCCTTCCATTCTGCAGGAGTTT'rAGCTATTTGAGATCGTG 

GACCATCAGTTTTGCACTTTAGAGAGTGTTTCTGACTCTAAACCTGTTTTATCAGAAAATTTGTTTTTTC 

TTGATCTTAGCTGGAAAAATCTGCCAACTTTACACAGTATTTACTTGGTTTTGACCCACAGAATATAGCA 

CGTTGTGC AAAC TGTCG ATTC AGC GAAACTTAAAAAAG AC AAG AAACT AC TG AGG AGCTTAGTAAC TGC T 

GTTTCTGT ACGTAGTGTTTAATCTTCC AAGC AC ATCTAGTGTCTGTC AGTTTC TAATTGGCATGTGTAGG 

CTGCTCTGTGACTGAAGAATTTTCAAACCAGCTTTACACCCTTCAGGAAAAATCCCTTGTGATTC 

TTAC TATCTGC CAGG AAACTGGT ACTC AAGATGTTGAAGC T AC AGTT ATTTTATG ATAGC AC ACTTCCC T 

TGATCTGCTTATTTTTATTCCATCACCATTTACCCTTTTTTTTTTTTTAAATTTTGT 

ATGCTCTTGATTTGTTGGTTACACAAATCAATTTTATTAAAAATCCAAAGATAAGTCTTTAGGTATATTT 

TGTACCAAATTAAATTAGAAGACAAAAATTGTGCTTTCATAGTTGCTACAAAGGTAAATA^ 

TTGGTACAAAACAACAAAATATATATATATTCTCATATATATATATATAGCTGATAAAATTACCTGAGGA 

GTGTAATGCTTATTTTTTTTGTGTATATCTTTGCAATCTATTTTATATATATTGACAAAAGAGACTGTC 

AATAC TT AGCC ATGC AG AAT ATGTGAC CAGACC AG AGC ATGTGT AGG AAG ACTTTACGGT AATC ATTAAC 

TCT AC CC CG AAATG ATGG AC TAC AAGTT AT AATGTGTGTT ACCTAC ACTTC AATC AGTAAT ATTAGC AAA 

TCTCCAAATGTTAGTCACATTGGTTTGTCTCCCTTGTACATTCTTTATTCATGATATTACAGTGCTGTAA 

AAATAAAGGC TTC AAAATGTTTTC TTAT AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

>gi | 24496777 | ref |NM_004779 . 3 | Homo sapiens CCR4-N0T transcription 
complex, subunit 8 (CN0T8 } , mRNA 

GCACGAGGGGGTAGAGGGAAAAGAGCTCCGGGCCAGGGGCTGCCGTCGCCGCCGTCGGGGAGTCAGCCCG 
CCAGCCCGCCAGCTCGTCAGCCCGCCACCAGCTTCGCGGGCCCTGTCGGTCCCGGTAAGCGGGCCTGCGC 
TT ACC G GAAAG AG G AGC GT AAG ATG AAAG AGTATC AGAC C AAAC ATTGTC TGGC TTGC A C TG T AAAAC T A 
GTT AGCTG AAGAC GACTTC TC AGGTTTC TTCAGG ATGCCTGC AGC ACTTGTGG AG AATAGCC AGGTTATC 
TGTGAAGTGTGGGCCAGTAATCTAGAAGAAGAGATGAGGAAGATCCGAGAAATCGTGCCCAGTTACAGTT 
AT ATTGCC ATGG AC AC AGAATTTC C AGGTGTTGTGGTGCGACC AATTGGTG AATTTCGT AGTTC CAT AG A 
TT ACC AATATC AGCTTC TGC GGTGC AATGTTGACC TTTTAAAAATTATCC AGC TGGGC CTTAC ATTC AC A 
AATG AG AAGGG AG AGT ATCC TTCTGG AATC AAT ACTTGGC AGTTC AATTTC AAATTT AAC CTTAC AG AG G 
ACATGTACTCCCAGGATTCCATAGATCTCCTTGCTAACTCAGGACTACAGTTTCAGAAGCATGAAGAGGA 
AGGGATTGACACACTGCACTTTGCAGAGCTGCTTATGACATCAGGAGTGGTTCTCTGTGACAATGTCAAA 
TGGCTTTCATTTCATAGTGGCTATGATTTTGGCTATATGGTAAAGTTGCTTACAGATTCTCGTTTGCCAG 
AAG AG G AAC ATG AATTCTT AC AT ATTCTG AACC TTTTC TC CCC ATC C ATTT ATG ATGTGAAAT AC CTGAT 
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GAAGAGCTGCAAAAATCTTAAGGGAGGTCTTCAGGAAGTTGCTGATCAGTTGGATTTGCAGAGGATTGGA 

AGGCAGCACCAGGCAGGCTCAGACTCACTG^TGACAGGAATGGCTTTCTTTAGGATGAAAGAGTTGTTTC 

TTGAGGACAGCATTGATGATGCCAAGTACTGTGGGCGGCTCTATGGCTTAGGCACAGGAGTGGCCCAGAA 

GCAGAATGAGGATGTGGACTCTGCCCAGGAGAAGATGAGCATCCTGGCGATTATCAACAACATGCAGCAG 

TGATGGCGCCAGGCTCTGCAGGGTGGGCCTGATCCCAGAGTGGTGCTTACTGTGCTGACTGTGTACTTAT 

CTTCCCCAAGAGAAAATGCTTCTTTTGAGCACACTGTACCTACCATCTGCATTGAGCAGAAAGACTTTTG 

TTTTACTGAAG AC AAAAGATGTTTTT ATTTT AGACC C AGAAG AG AGG AGTTTGC TC TG AATTTGTAAAT A 

AGTCTTCCCCATTCCTCATACTCGAGCCTCTCCTCTCTGGTTGCCTCCTGCCACCAGCATCCATGGCTCA 

TTTGACACCTTTTTAAATATCAGGACAAGTCTGAAACAAAGTAGTAAAATGTATATAACTCTTACCT<3TT 

GTCATTCTTTTTCTTTTAAATTTGTTGCTAATCTCTGATAATGAAGATTCTTACTCTGATTCTCAGCTGA 

GCTGTGAGGGCTTCCAGGGAAAATGGAACAAAATGGTGTTCTTAGGTAATGGGTTGTAGATACTGAGTCT 

TCC TTTCCTTTTCTGAC C C TTCTC GAGGAC ATTTGC TTTC C TC AC AC TTTTGTAGTC TC TC TTT AC ATAT 

T AC TATATGGAAATGAATTGC TC TGTGC TG AAATTTG AAGACC AG AT AATGAAAC TG AAAAGC AAAC AAT 

TTTTACTG AATC TGTCTAC CT^TC ATTC ATGAG AACTC C AGAATGAGTGTTGACC ACTGAAGC ATCTTTT A 

AGTCTGTGTTCCATTGTGCCATTCAGGTTTGCTGTCACATATGCATCATCTGAAATCA 

GTACAATAAAATATCCTGGATTTGATCCTGAAGGAAACTAGTAAGATCAGATTTTTGGGTCATGTCTGTT 

GTATTTTCAGTAATGTGATTTCAGATGGTCATCTGGATTCTCCCACTTCTCTACTCCATTATTTCTCTAC 

TTTTCCTTCCAGCAAACCTTGAAACGTGAGGGAGATGGATTAATGTGAGTAACAGGAATGTC 

AAGCTAGAGTGGTTACATTTAATCAGGCAGTAAGATAATTTGGGTTCTTGAGTTGTTTTGGAGTAATATC 

CCACAACTGGGGTAGGAAGCTCAGGACTTTTTTCTTTAAAGCTAGTCATTTCAAAAGCATATTGTATTT 

TTTGAATGACTACAGTATGGACAATTTCAAAAACCAAAACCCACTTTGGATTGGTGGAAGTAAAAACTC 

TAACTCACTC AAGTGAATGAATGGTCTTGCATTTT AAAAGC TTATGGGAAACTCAATT^ 

AAAATGTCAAGTATTATAAGCTGGTATTTAAGATGCTTGTAAATACTATTTATGTTTTTA^ 

ATAAAGATTTC TTTTTAAAAAAAAAAAAAAAAAAAAAAA 

>gi | 6996002 |ref|NM_00437 1.2 | Homo sapiens coatomer protein complex, 
subunit alpha (COPA) , mRNA 

G AG AAGGGGACCTTC AGGTC C AGGC AAAGGGGG AAC TTCTGTCGTGGG AAC GAAAAAG AAAGAGG ATTTA 
CAGGGTGGGGGGACAGAGGGGCAGCAGGAACCAGAAGGGAGACAGTGGCGGTCGCACCGGGGCCGATCCG 
AGAGTTCCCCTTAGAGAACGGAGCTCACGGGCGGGGAGGCCTCACCTGCTAGTAGGACGCAGAAAGACAG 
AAGGCGAAGGAGACCCCCTGCCGTAGCCATCTTGCCTCTCTGCTGAGCGGAAGCCCCCGTTCGGCTCCTG 



CGTGTTTCCAGGCGCTTCCGGAAACCGCGGGAGAGGGTCGCTGACGTGGAGGCGTCCGAAGGGCAGCAGG 
GTGTGTCGGGGC TC GGATTAAGAC ATCGG AGTCGGAGACCTGAGAGATGTTAACCAAATTCGAGAC CAAG 
AGCGCGCGGGTCAAAGGGCTCAGCTTTCACCCCAAAAGACCTTGGATCCTGACTAGTTTACATAATGGGG 
TCATCCAGTTATGGGACTATCGGATGTGCACTCTCATTGACAAGTTTGATC 

AGGCATTGACTTCCATAAGCAGCAGCCACTGTTCGTCTCTGGAGGAGATGACTATAAGATTAAGGTTTGG 
AATT AC AAGC TTCGGCGCTGTCTTTTCAC ATTGCTTGGGC ACTTAGATTATATTC GC AC C AC GTTTTTTC 
ATC ATG AAT ATC CC TGG ATTC TG AGTGCCTCCGATG ATC AG ACC ATCCGAGTGTGG AATTGGC AATCTAG 
AAC C TGTGTTTGTGTGTTAAC AGGGC AC AACC ATTATGTGATGTGTGCTC AGTTCC ACCCC AC AGAAG AC 
TTGGTAGTATCAGCCAGCCTGGACCAGACTGTGCGCGTTTGGGATATTTCTGGTCTGAGGAAAAAAAACC 
TGTCCCCTGGTGCGGTGGAATCGGATGTGAGAGGAATAACTGGGGTTGATCTATTTGGAACTACAGATGC 
AGTGGTGAAGCATGTACTAGAGGGTCACGATCGTGGAGTAAACTGGGCTGCCTTCCACCCCACTATGCCC 
CTTATTGTATCTGGGGCAGATGATCGTCAAGTGAAGATCTGGCGCATGAATGAATCAAAGGCATGGGAGG 
TTGATACCTGCCGGGGCCATTACAACAATGTATCTTGTGCCGTCTTCCACCCTCGCCAAGAGTTGATCCT 
C AGC AATTC TG AGG AC AAGAGT ATTC GAGTC TGGG AT ATGTCTAAGCGGAC TGGGGTTC AG ACTTTC CGC 
AG AGAC CATGATC GTTTCTGGGTCC TAGCTGC TC ACC CT AAC CTTAAC CTC TTTGC AGC AGGCC ATG ATG 
GTGGTATGATTGTGTTTAAGCTGGAACGGGAACGGCCAGCCTATGCTGTTCATGGCAATATGCTACACTA 
TGTCAAGGACCGATTCTTACGACAGCTGGATTTCAACAGCTCCAAAGATGTAGCTGTGATGCAGTTGCGG 
AGTGGTTC C AAGTTTCC AGT ATTC AATATGTC ATAC AATCC AGC AG AAAATGC AGTCC TGCTTTG 
GAGCT AGC AATCTAG AG AAT AGTACCTATGACC TGT AC ACC ATC CCTAAAGATGCTG ACTCC CAGAATC C 
TGATGC GCCTGAAGGGAAACGATCCTC AGGC CTGAC AGC CGTTTGGGTCGCTCGAAATCGGTTTGCTGTC 
CTAGATCGGATGCATTCGCTTCTGATCAAGAATCTGAAGAATGAGATCACCAAAAAGGTACAGGTGCCCA 
ACTGTGATGAGATCTTCTATGCTGGCACAGGCAATCTCCTGCTTCGAGATGCGGACTCTATCACACTCTT 
TGACGTACAGCAGAAGCGGACTCTGGCATCTGTGAAGATTTCTAAAGTGAAATACGTTATCTGGTCAGCA 
G AC ATGTC ACATGTAGCACT AC T AGCC AAAC ACGCC ATTGTG ATCTGTAACCGC AAACTGG ATGCTTT AT 
GTAACATTCATGAGAACATTCGTGTCAAGAGTGGGGCCTGGGATGAGAGTGGGGTATT.TATCTATACCAC 
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AAGC AAC C AC ATC AAATATGCTGTC ACC AC TGGGGAC CACGGG ATC ATTC GAACTCTGGATTT ACCC ATC 

TATGTCACACGGGTGAAGGGCAACAATGTATACTGCCTAGACAGGGAGTGTCGTCCCCGGGTACTCACCA 

TTGATCCCACTGAGTTCAAATTCAAGCTGGCCCTGATCAACAGAAAATATGATGAGGTACTGCACATGGT 

GAGGAATGCCAAACTAGTTGGCCAGTCTATTATTGCTTATCTCCAGAAGAAGGGCTATCCTGAAGTGGCA 

CTGCATTTTGTCAAGGATGAGAAAACTCGC1TTAGTCTGGCACTGGAGTGTGGAAACATTGAGATTGCTC 

TGGAAGCAGCCAAAGCACTGGATGACAAGAACTGCTGGGAAAAGCTGGGAGAAGTGGCCCTGCTGCAGGG 

GAACCACCAGATTGTGGAAATGTGCTATCAGCGTACCAAAAACTTTGACAAAGTTTCCTTCCTGTATCTT 

ATC ACTGGC AAC TTAG AAAAAC TTCGC AAG ATG ATG AAGATTGCTG AG ATC AG AAAGG AC ATG AGTGGCC 

AC TATC AG AATGC C CTAT AC CTGGGTG ATGTGTCAGAGC GTGTGCGG ATCC TG AAG AACTGTGGACAG AA 

GTCC CTGGCC T ATCTCAC AGCTGC TACCCATGGCTTAGATGAAGAAGCTGAGAGCCT AAAGG AGACATTT 

GACC C AG AGAAGG AGAC AATCCC AGAC ATTGAC CC TAATGCC AAGCTGC TCC AGCC AC C TGC ACCT ATC A 

TG C C ATTGG AT AC C AAT TGGCC TTT ATTG AC TGT AT C C AAAG G AT TTTTTG AAG G C AC C ATTGC C AGC AA 

AGGGAAGGG AGGAGC AC TGGCTGCTGAC ATTGAC ATTGAC AC TGTTGGT AC AG AGGGCTGGGGAGAGG AT 

GCAGAGCTGCAGTTGGATGAAGATGGGTTTGTGGAGGCTACAGAAGGTTTGGGGGATGATGCTCTTGGCA 

AGGG AC AGG AAGAAGG AGGTGGCTGGGATGTAGAAG AAGATC TGG AGCTCC C TCCTGAGCTGGATATATC 

CCCTGGGGCAGCTGGTGGGGCTGAAGATGGTTTCTTTGTGCCCCCAACCAAGGGAACAAGTCCAACTCAG 

ATCTGGTGTAATAACTCTCAGCTTCCAGTTGATCACATCCTGGCAGGCTCTTTCGAAACAGCCATGCGGC 

TC C TTC ATG AC CAAGTAGGGGTAATCC AGTTTGGCCCCTAC AAGC AACTGTTCCTAC AGAC AT ACGCCCG 

AGGCCGCACAACCTATCAGGCTCTGCCCTGCCTACCCTCCATGTATGGCTATCCTAATCGCAACTGGAAG 

GATGC AGGGCTGAAGAATGGTGT AC C AGCTGTGGGCCTG AAGCTTAATG ACCTC ATCCAACGGTTGC AGC 

TGTGCTACCACCTCACCACAGTTGGCAAATTTGAGGAGGCTGTGGAAAAATTCCGTTCCATCCTTCTCAG 

TGTGCCACTTCTTGTTGTGGACAATAAACAAGAGATTGCAGAGGCCCAGCAGCTCATCACCATTTGCCGT 

GAGTACATTGTGGGTTTGTCCGTGGAGACAGAAAGGAAGAAGCTGCCCAAAGAGACTCTAGAACAGCAGA 

AGC GC ATC TGTG AG ATGGC AGCC T AT TTCAC C C AC TC AAAC C TGC AGC C TG TGC AC AT G ATC C T G GT GC T 

GCGTACAGCCCTCAATCTGTTCTTCAAGCTCAAGAACTTCAAGACAGCTGCCACCTTTGCTCGGCGCCTA 

CTAG AACTCGGGCC C AAGCCTGAGGTGGCC C AAC AGACC CGAAAAATC CTGTC TGC C TGTGAG AAGAATC 

CC AC AG ATGCCTAC C AGCTC AATTATGAC ATGC AC AACC CCTTTGACATTTGTGC TGC ATC AT ATCGGC C 

C ATCT ACCGTGGAAAGCCAGT AGAAAAGTGTCC ACTC AGTGGGGCCTGC T ATTCC C CTGAGTTC AAAGGT 

C AAATCTGC AGGGTC AC CAC AGTG AC AG AGATTGGC AAAGATGTGATTGGTTTAAGGATC AGTCC TCTGC 

AGTTTCGC TAAGGCCCC CTTTGTGTGCATGGGTC AGTC ACCATATGTTC CC CC C AG AG AATGTGTCT ATA 

TCCTCCTTCTAACAGCACCTTCCCCCTGCAGCTACTCTTCAGATCTGGCTCTCTGTACCCTAAAACCTAG 

TATC TTTTTCTCTTC T ATGG AAAATC CGAAGGTCT AAACTTG ACTTTTTTGAGGTCTTCTC AACTTG ACT 

ACAGTTGTGCTCATAATTGTCCTTGCCTTTCCAGCTTAATTATTTTAAGGAACAAATGAAAACTCTGGGC 

TCGGTGGAGTGGCTCATACCTGTAATCCCAGCACTTTGGGAGGCTACGGTGGGCAGATCATCTGAGGCCA 

GGAGTTCGAGACCTGCCTGGCCAACATGGCAACACCCCGTCTCTAATAAAAATATAAAAATTAGCCTGGC 

ATGGTAGCATGCGCCTATAGTCCCAGCTGCTCAGGAGGCTGAGGCATGAGAATCGCTTGAACCTAGGAGG 

TGGAGGTTGC ATTC AACTGAGATC AT ACC AC TTC ATTCC AGCCTGGGTG AC AGAGC AAGAC TCTGTC TCA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGAAAAC TC TGTGATGG AC ATTTGTTT AGT AAATC CC 

TTCAGTATTTATC C CTCC TTTCCC C AC AGC AGCTTTCTTTCC TGTCAACTAG AAAGG AGC AGG ATGT AAT 

AAATACATTTTGGTGTGACTAGGCCACACCAACTCTTAATCATCTCCCATTTTCCTTAGACATTTAAATT 

TCAAGGCAGGTACCCTCTGTGTACTCAGAAATTTGAAGAAGTTATTTGGTTTTCCAAAATGCACACTGCG 

GGTTATTGATTTGTTCTTT AC AACTATTGTTC TC AT ATTTCTC AC AC TAAAT AAATC TC TATG AGAGC TT 

CTTGAAAAAAAAAAAAAAAAAGCG 

>gi |21618358|ref |NM_004104.3 | Homo sapiens fatty acid synthase (FASN) , 
mRNA 

GAGGGAGCCAGAGAGACGGCAGCGGCCCCGGCCTCCCTCTCCGCCGCGCTTCAGCCTCCCGCTCCGCCGC 
GCTCCAGCCTCGCTCTCCGCCGCCCGCACCGCCGCCCGCGCCCTCACCAGAGCAGCCATGGAGGAGGTGG 
TGATTGCCGGCATGTCCGGGAAGCTGCCAGAGTCGGAGAACTTGCAGGAGTTCTGGGACAACCTCATCGG 
CGGTGTGGACATGGTCACGGACGATGACCGTCGCTGGAAGGCGGGGCTCTACGGCCTGCCCCGGCGGTCC 
GGCAAGCTGAAGGACCTGTCTAGGTTTGATGCCTCCTTCTTCGGAGTCCACCCCAAGCAGGCACACACGA 
TGGACCCTCAGCTGCGGCTGCTGCTGGAAGTCACCTATGAAGCCATCGTGGACGGAGGCATCAACCCAGA 
TTCACTCCGAGGAACACACACTGGCGTCTGGGTGGGCGTGAGCGGCTCTGAGACCTCGGAGGCCCTGAGC 
CGAGACCCCGAGACACTCGTGGGCTACAGCATGGTGGGCTGCCAGCGAGCGATGATGGCCAACCGGCTCT 
CCTTCTTCTTCGACTTCAGAGGGCCCAGCATCGCACTGGACACAGCCTGCTCCTCCAGCCTGATGGCCCT 
GCAGAACGCCTACCAGGCCATCCACAGCGGGCAGTGCCCTGCCGCCATCGTGGGGGGCATCAATGTCCTG 
CTGAAGCCCAACACCTCCGTGCAGTTCTTGAGGCTGGGGATGCTCAGCCCCGAGGGCACCTGCAAGGCCT 
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TCGACACAGCGGGGAATG^TACTGCCGCTCGGAGGGTGTGGTGGCCGTCCTGCTGACCAAGAAGTCCCT 
GGCCCGGCGGGTGTACGCCACCATCCTGAACGCCGGCACCAATACAGATGGCTTCAAGGAGCAAGGCGTG 
ACCTTCCCCTCAGGGGATATCCAGGAGCAGCTCATCCGCTCGTTGTACCAGTCGGCCGGAGTGGCCCCTG 
AGTCATTTGAATACATCGAAGCCCACGGCACAGGCACCAAGGTGGGCGACCCCCAGGAGCTGAATGGCAT 
CACCCGAGCCCTGTGCGCCACCCGCCAGGAGCCGCTGCTCATCGGCTCCACCAAGTCCAACATGGGGCAC 
CCGGAGCCAGCCTCGGGGCTGGCAGCCCTGGCCAAGGTGCTGCTGTCCCTGGAGCACGGGCTCTGGGCCC 
CCAACCTGCACTTCCATAGCCCCAACCCTGAGATCCCAGCGCTGTTGGATGGGCGGCTGCAGGTGGTGGA 
CCAGCCCCTGCCCGTCCGTGGCGGCAACGTGGGCATCAACTCCTTTGGCTTCGGGGGCTCCAACGTGCAC 
ATCATCCTGAGGCCCAACACGCAGCCGCCCCCCGCACCCGCCCCACATGCCACCCTGCCCCGTCTGCTGC 
GGGCCAGCGGACGCACCCCTGAGGCCGTGCAGAAGCTGCTGGAGCAGGGCCTCCGGCACAGCCAGGACCT 
GGCTTTCCTGAGCATGCTGAACGACATCGCGGCTGTCCCCGCCACCGCCATGCCCTTCCGTGGCTACGCT 
GTGCTGGGTGGTGAGCGCGGTGGCCCAGAGGTGCAGCAGGTGCCCGCTGGCGAGCGCCCGCTCTGGTTCA 
TCTGCTCTGGGATGGGCACACAGTGGCGCGGGATGGGGCTGAGCCTCATGCGCCTGGACCGCTTCCGAGA 
TTCC ATCCTACGCTC CGATGAGGCTGTG AAGCC ATTC GGCCTG AAGGTGTCAC AGCTGC TGC TGAGC AC A 
GACG AGAGC ACC TTTG ATGAC ATCGTC C ATTC GTTTGTG AGCCTG ACTGCC ATCC AGATAGGCC TC ATAG 
. ACCTGCTGAGCTGCATGGGGCTGAGGCCAGATGGCATCGTCGGCCACTCCCTGGGGGAGGTGGCCTGTGG 
CTACGCCGACGGCTGCCTGTCCCAGGAGGAGGCCGTCCTCGCTGCCTACTGGAGGGGACAGTGCATCAAA 
GAAGCCCATCTCCCGCCGGGCGCCATGGCAGCCGTGGGCTTGTCCTGGGAGGAGTGTAAACAGCGCTGCC 
CCCCGGCGGTGGTGCCCGCCTGCCACAACTCCAAGGACACAGTCACCATCTCGGGACCTCAGGCCCCGGT 
GTTTGAGTTCGTGGAGCAGCTGAGGAAGGAGGGTGTGTTTGCCAAGGAGGTGCGGACCGGCGGTATGGCC 
TTC C AC TC CTACTTC ATGGAGGC C ATCGC ACCCCC ACTGCTGC AGG AGC TC AAG AAGGTG ATC CGGG AGC 
CGAAGCCACGTTCAGCCCGCTGGCTCAGCACCTCTATCCCCGAGGCCCAGTGGCACAGCAGCCTGGCACG 
CACGTCCTCCGCCGAGTACAATGTCAACAACCTGGTGAGCCCTGTGCTGTTCCAGGAGGCCCTGTGGCAC 
GTGCCTGAGCACGCGGTGGTGCTGGAGATCGCGCCCCACGCCCTGCTGCAGGCTGTCCTGAAGCGTGGCC 
TGAAGCCGAGCTGCACCATCATCCCCCTGATGAAGAAGGATCACAGGGACAACCTGGAGTTCTTCCTGGC 
CGGCATCCGGAGGCTGCACCTCTCAGGCATCGACGCCAACCCCAATGCCTTGTTCCCACCTGTGGAGTTC 
CCAGCTCCCCGAGGAACTCCCCTCATCTCCCCACTCATCAAGTGGGACCACAGCCTGGCCTGGGACGTGC 
CGGCCGCCGAGGACTTCCCCAACGGTTCAGGTTCCCCCTCAGCCGCCATCTACAACATCGACACCAGCTC 
CGAGTCTCCTGACCACTACCTGGTGGACCACACCCTCGACGGTCGCGTCCTCTTCCCCGCCACTGGCTAC 

AGGATGTGGTGCTGCACCAGGCCACCATCCTGCCCAAGACTGGGACAGTGTCCCTGGAGGTACGGCTCCT 

GGAGGCCTCCCGTGCCTTCGAGGTGTCAGAGAACGGCAACCTGGTAGTGAGTGGGAAGGTGTACCAGTGG 

GATGACCCTGACCCCAGGCTCTTCGACCACCCGGAAAGCCCCACCCCCAACCCCACGGAGCCCCTCTTCC 

TGGC CC AGGCTGAAGTTTAC AAGGAGCTGCGTCTG CGTGGCT ACGACTACGGC C CTC ATTTC CAGGGCAT 

CCTGGAGGCCAGCCTGGAAGGTGACTCGGGGAGGCTGCTGTGGAAGGATAACTGGGTGAGCTTCATGGAC 

ACCATGCTGCAGATGTCCATCCTGGGCTCGGCCAAGCACGGCCTGTACCTGCCCACCCGTGTCACCGCCA 

TC C AC ATCGAC CC TGCC ACCC AC AGGC AG AAGC TGT AC AC ACTGC AGGAC AAGGCCC AAGTGGC TGACGT 

GGTGGTGAGCAGGTGGCTGAGGGTCACAGTGGCCGGAGGCGTCCACATCTCCGGGCTCCACACTGAGTCG 

GCCCCGCGGCGGCAGCAGGAGCAGCAGGTGCCCATCCTGGAGAAGTTTTGCTTCACTCCCCACACGGAGG 

AGGGGTGCCTGTCTGAGCGCGCTGCCCTGCAGGAGGAGCTGCAACTGTGCAAGGGGCTGGTGCAGGCACT 

GCAGACCAAGGTGACCCAGCAGGGGCTGAAGATGGTGGTGCCCGGACTGGATGGGGCCCAGATCCCCCGG 

GACCCCTCACAGCAGGAACTGCCCCGGCTGTTGTCGGCTGCCTGCAGGCTTCAGCTCAACGGGAACCTGC 

AGCTGGAGCTGGCGCAGGTGCTGGCCCAGGAGAGGCCCAAGCTGCCAGAGGACCCTCTGCTCAGCGGCCT 

CCTGGACTCCCCGGCACTCAAGGCCTGCCTGGACACTGCCGTGGAGAACATGCCCAGCCTGAAGATGAAG 

GTGGTGGAGGTGCTGGCCGGCCACGGTCACCTGTATTCCCGCATCCCAGGCCTGCTCAGCCCCCATCCCC 

TGCTGCAGCTGAGCTACACGGCCACCGACCGCCACCCCCAGGCCCTGGAGGCTGCCCAGGCCGAGCTGCA 

GCAGCACGACGTTGCCCAGGGCCAGTGGGATCCCGCAGACCCTGCCCCCAGCGCCCTGGGCAGCGCCGAC 

CTCCTGGTGTGCAACTGTGCTGTGGCTGCCCTCGGGGACCCGGCCTCAGCTCTCAGCAACATGGTGGCTG 

CCCTGAGAGAAGGGGGCTTTCTGCTCCTGCACACACTGCTCCGGGGGCACCCCTCGGGACATGTGGCCTT 

CCTCACCTCCACTGAGCCGCAGTATGGCCAGGGCATCCTGAGCCAGGACGCGTGGGAGAGCCTCTTCTCC 

AGGGTGTCCGTGCGCCTGGTGGGCCTCAAGAAGTCCTTCTACGGCTCCACGCTCTTCCTGTGCCGCCGGC 

CCACCCCGCAGGACAGCCCCATCTTCCTGCCGGTGGACGATACCAGCTTCCGCTGGGTGGAGTCTCTGAA 

GGGCATCCTGGCTGACGAAGACTCTTCCCGGCCTGTGTGGCTGAAGGCCATCAACTGTGCCACCTCGGGC 

GTGGTGGGCTTGGTGAACTGTCTCCGCCGAGAGCCCGGCGGAACGCTCCGGTGTGtGCTGCTCTCCAACC 

TCAGCAGCACCTCCCACGTCCCGGAGGTGGACCCGGGCTCCGCAGAACTGCAGAAGGTGTTGCAGGGAGA 

CCTGGTGATGAACGTCTACCGCGACGGGGCCTGGGKXX^TTTCCGCCACTTCCTGCTGGAGGAGGACAAG 

CCTGAGGAGCCGACGGCACATGCCTTTGTGAGCACCCTCACCCGGGGGGACCTGTCCTCCATCCGCTGGG 
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TCTGCTCCTCGCTGCGCCATGCCCAGCCCACCTGCCCTGGCGCCCAGCTCTGCACGGTCTACTACGCCTC 

CCTCAACTTCCGCGACATCATGCTGGCCACTGGCAAGCTGTCCCCTGATGCCATCCCAGGGAAGTGGACC 

TCCCAGGACAGCCTGCTAGGTATGGAGTTCTCGGGCCGAGACGCCAGCGGCAAGCGTGTGATGGGACTGG 

TGCCTGCCAAGGGCCTGGCCACCTCTGTCCTGCTGTCACCGGACTTCCTCTGGGATGTGCCTTCCAACTG 

GACGCTGGAGGAGGCGGCCTCGGTGCCTGTCGTCTACAGCACGGCCTACTACGCGCTGGTGGTGCGTGGG 

CGGGTGCGCCCCGGGGAGACGCTGCTCATCCACTCGGGCTCGGGCGGCGTGGGCCAGGCCGCCATCGCCA 

TCGCCCTCAGTCTGGGCTGCCGCGTCTTCACCACCGTGGGGTCGGCTGAGAAGCGGGCGTACCTCCAGGC 

CAGGTTCCCCCAGCTCGACAGCACCAGCTTCGCCAACTCCCGGGACACATCCTTCGAGCAGCATGTGCTG 

TGGCACACGGGCGGGAAGGGCGTTGACCTGGTCTTGAACTCCTTGGCGGAAGAGAAGCTGCAGGCCAGCG 

TGAGGTGCTTGGCTACGCACGGTCGCTTCCTGGAAATTGGCAAATTCGACCTTTCTCAGAACCACCCGCT 

CGGCATGGCTATCTTCCTGAAGAACGTGACATTCCACGGGGTCCTACTGGATGCGTTCTTCAACGAGAGC 

AGTGCTGACTGGCGGGAGGTGTGGGCGCTTGTGCAGGCCGGCATCCGGGATGGGGTGGTACGGCCCCTCA 

AGTGCACGGTGTTCCATGGGGCCCAGGTGGAGGACGCCTTCCGCTACATGGCCCAAGGGAAGCACATTGG 

CAAAGTCGTCGTGCAGGTGCTTGCGGAGGAGCCGGAGGCAGTGCTGAAGGGGGCCAAACCCAAGCTGATG 

TCGGCCATCTCCAAGACCTTCTGCCCGGCCCACAAGAGCTACATCATCGCTGGTGGTCTGGGTGGCTTCG 

GCCTGGAGTTGGCGCAGTGGCTGATACAGCGTGGGGTGCAGAAGCTCGTGTTGACTTCTCGCTCCGGGAT 

CCGGACAGGCTACCAGGCCAAGCAGGTCCGCCGGTGGAGGCGCCAGGGCGTACAGGTGCAGGTGTCCACC 

AGC AACATCAGCTC ACTGGAGGGGGCC CGGGGC C TC ATTGCCG AGGCGGC GC AGC TTGGGCCCGTGGGC G 

GCGTCTTCAACCTGGCCGTGGTCTTGAGAGATGGCTTGCTGGAGAACCAGACCCCAGAGTTCTTCCAGGA 

CGTCTGCAAGCCCAAGTACAGCGGCACCCTGAACCTGGACAGGGTGACCCGAGAGGCGTGGCCTGAGCTG 

GACTACTTTGTGGTCTTCTCCTCTGTGAGCTGCGGGCGTGGCAATGCGGGACAGAGCAACTACGGCTTTG 

CCAATTCCGCCATGGAGCGTATCTGTGAGAAACGCCGGCACGAAGGCCTCCCAGGCCTGGCCGTGCAGTG 

GGGCGCCATCGGCGACGTGGGCATTTTGGTGGAGACGATGAGCACCAACGACACGATCGTCAGTGGCACG 

CTGCCCCAGCGCATGGCGTCCTGCCTGGAGGTGCTGGACCTCTTCCTGAACCAGCCCCACATGGTCCTGA 

GCAGCTTTGTGCTGGCTGAGAAGGCTGCGGCCTATAGGGACAGGGACAGCCAGCGGGACCTGGTGGAGGC 

CGTGGCACACATCCTGGGCATCCGCGACTTGGCTGCTGTCAACCTGGACAGCTCACTGGCGGACCTGGGC 

CTGG AC TCGCTC ATGAGC GTGGAGGTGCGCC AG AC GCTGGAGCGTGAGC TC AAC CTGGTGC TGTCC GTGC 

GCG AGGTGC GGC AACTC AC GC TC CGGAAACTGC AGG AGCTGTCC TC AAAGGCGG ATGAGGC C AGCGAGCT 

GGCATGCCCCACGCCCAAGGAGGATGGTCTGGCCCAGCAGCAGACTCAGCTGAACCTGCGCTCCCTGCTG 

GTGAACCCGGAGGGCCCCACCCTGATGCGGCTCAACTCCGTGCAGAGCTCGGAGCGGCCCCTGTTCCTGG 

TGCACCCAATCGAGGGCTCCACCACCGTGTTCCACAGCCTGGCCTCCCGGCTCAGCATCCCCACCTATGG 

CCTGCAGTGCACCCGAGCTGCGCCCCTTGACAGCATCCACAGCCTGGCTGCCTACTACATCGACTGCATC 

AGGC AGGTGC AGC CCGAGGGCCCCT AC CGCGTGGCCGGCTACTCCTACGGGGCCTGCGTGGCCTTTG AAA 

TGTGCTCCCAGCTGCAGGCCCAGCAGAGCCCAGCCCCCACCCACAACAGCCTCTTCCTGTTCGACGGCTC 

GCCCACCTACGTACTGGCCTACACCCAGAGCTACCGGGCAAAGCTGACCCCAGGCTGTGAGGCTGAGGCT 

GAG AC GG AGGCC ATATGCTTCTTC GTGC AGC AGTTC AC GG AC ATGG AGCAC AAC AGGGTGC TGG AGGCGC 

TGC TGCCGCTGAAGGGCCTAGAGGAGC GTGTGGC AGCC GCCGTGG ACC TG ATC ATC AAGAGCC AC C AGGG 

CCTGGACCGCCAGGAGCTGAGCTTTGCGGCCCGGTCCTTCTACTACAAGCTGCGTGCCGCTGAGCAGTAC 

ACACCCAAGGCCAAGTACCATGGCAACGTGATGCTACTGCGCGCCAAGACGGGTGGCGCCTACGGCGAGG 

ACCTGGGCGCGGACTACAACCTCTCCCAGGTATGCGACGGGAAAGTATCCGTCCACGTCATCGAGGGTGA 

CCACCGCACGCTGCTGGAGGGCAGCGGCCTGGAGTCCATCATCAGCATCATCCACAGCTCCCTGGCTGAG 

CCACGCGTGAGCGTGCGGGAGGGCTAGGCCCGTGCCCCCGCCTGCCACCGGAGGTCACTCCACCATCCCC 

ACCCCACCCCACCCCACCCCCGCCATGCAACGGGATTGAAGGGTCCTGCCGGTGGGACCCTGTCCGGCCC 

AGTGCCACTGCCCCCCGAGGCTGCTAGACGTAGGTGTTAGGCATGTCCCACCCACCCGCCGCCTCCCACG 

GCACCTCGGGGACACCAGAGCTGCCGACTTGGAGACTCCTGGTCTGTGAAGAGCCGGTGGTGCCCGTGCC 

CGCAGGAACTGGGCTGGGCCTCGTGCGCCCGTGGGGTCTGCGCTTGGTCTTTCTGTGCTTGGATTTGCAT 

ATTTATTGCATTGCTGGTAGAGACCCCCAGGCCTGTCCACCCTGCCAAGACTCCTCAGGCAGCGTGTGGG 

TCCCGCACTCTGCCCCCATTTCCCCGATGTCCCCTGCGGGCGCGGGCAGCCACCCAAGCCTGCTGGCTGC 

GGCCCCCTCTCGGCCAGGCATTGGCTCAGCCCGCTGAGTGGGGGGTCGTGGGCCAGTCCCCGAGGAGCTG 

GGCCCCTGCACAGGCACACAGGGCCCGGCCACACCCAGCGGCCCCCCGCACAGCCACCCGTGGGGTGCTG 

CCCTTATGCCCGGCGCCGGGCACCAACTCCATGTTTGGTGTTTGTCTGTGTTTGTTTTTCAAGAAATGAT 

TC AAATTGCTGCTTGGATTTTGAAATTTACTGTAACTGTC AGTGTAC ACGTC TGG ACCCC GTTTC ATTTT 

TACACCAATTTGGTAAAAATGCTGCTCTCAGCCTCCCACAATTAAACCGCATGTGATCTCC 

>gi | 24234687 | ref |NM_004134 . 3 | Homo sapiens heat shock 70kDa protein 9B 
(mortalin-2) (HSPA9B) , nuclear gene encoding mitochondrial protein, 
mRNA 
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GGAGGTTTCC AG AAGCGCTGCC GC C ACC GC ATC GC GC AGC TC TTTGC CGTC GG AGCGC TTGTTTGC TGCC 
TCGTACTCCTCCATTTATCCGCCATGATAAGTGCCAGCCGAGCTGCAGCAGCCCGTCTCGTGGGCGCCGC 
AGCCTC CCGGGGCC C TACGGCC GC CC GCC ACC AGGATAGCTGGAATGGC CTT AGTC ATG AGGCTTTTAGA 
C TTGTTTC AAGGC GGG ATT ATGC ATC AGAAGCAATC AAGGGAGC AGTTGTTGGT ATTGATTTGGGTACT A 
CCAACTCCTGCGTGGCAGTTATGGAAGGTAAACAAGCAAAGGTGCTGGAGAATGCCGAAGGTGCCAGAAC 
C AC C CC TTCAG TTGTGG C CTTT AC AGC AGATGGTGAGC GACTTGTTGG AATGCCGGCCAAGCGACAGGC T 
GTCACCAACCCAAACAATACATTTTATGCTACCAAGCGTCTCATTGGCCGGCGATATGATGATCCTGAAG 
TACAGAAAGACATTAAAAATGTTCCCTTTAAAATTGTC 

TCATGGGAAATTGTATTCTCCGAGTCAGATTGGAGCATTTGTGTTGATGAAGATGAAAGAGACTGCAGAA 
AATT AC TTGGGGC AC AC AGC AAAAAATGC TGTGATC AC AGTCC CAGCTTATTTC AATGACTCGC AG AGAC 
AGGCCACTAAAGATGCTGGCCAGATATCTGGACTGAATGTGCTTCGGGT^ 

TGCTCTTGC CT ATGGTCTAGAC AAATC AGAAGAC AAAGTCATTGCTGTAT ATGATTT AGGTGGTGGAAC T 
TTTGAT ATTTCT ATCC TGGAAATTC AG AAAGG AGT ATTTGAGGTGAAATC C AC AAATGGGG AT AC CTTC T 
T AGGTGGGG AAGAC TTTGACC AGGCCTTGCT ACGGC AC ATTGTG AAGGAGTTC AAGAG AG AG AC AGGGGT 
TGATTTGACTAAAGACAACATGGCACTTCAGAGGGTACGGGAAGCTGCTGAAAAGGCTAAATGTCAACTC 
TCC TC ATCTGTGCAGAC TGAC ATC AATTTGCCC T ATCTTAC AATGG ATTCTTC TGG ACCC AAGCATTTGA 
ATATCAAGTTGACCCGTGCTCAATTTGAAGGGATTGT^ 

C CAAAAAGC T ATGC AAG ATGC AG AAG TC AG C AAGAGTG AC AT AGG AG AAGTG ATTC TTGTGGGTGG C ATG 
ACTAGGATGCCCAAGGTTCAGCAGACTGTACAGGATCTTTTTGGCAGAGCCCCAAGTAAAGCTGTCAATC 
CTGATGAGGCTGTGGCCATTGGAGCTGCCATTCAGGGAGGTGTGTTGGCCGGCGATGTCACGGATGTGCT 
GCTCCTTGATGTCACTCCCCTGTCTCTGGGTATTGAAACTCTAGGAGGTGTCTTTACCAAACTTATTAAT 
AGGAATACCACTATTCCAACCAAGAAGAGCCAGGTATTCTCTACTGCCGCTGATGGTCAAACGCAAGTGG 
AAATTAAAGTGTGTC AGGGTGAAAGAGAG ATGGC TGGAGAC AAC AAAC TC CTTGG AC AGTTTAC TTTGAT 
TGG AATTCC ACCAGCC CC TCGTGGAGTTC CTC AG ATTG AAGTT AC ATTTG AC ATTGATGCC AATGGG AT A 
GTAC ATGTTTC TGCTAAAGATAAAGGC AC AGGAC GTG AGC AGC AG ATTGTAATC C AGTCTTC TGGTGGAT 
TAAGCAAAGATGATATTGAAAATATGGTTAAAAATGCAGAGAAATATGCTGAAGAAGACCGGCGAAAGAA 
GGAACGAGTTGAAGCAGTTAATATGGCTGAAGGAATCATTCACGACACAGAAACCAAGATGGAAGAATTC 
AAGGACCAATTACCTCCTGATGAGTGCAACAAGCTGAAAGAAGAGATTTCCAAAATCAGGGAGCTCCTGG 
C T AG AAAAG AC AGCGAAAC AGGAGAAAATATT AG AC AGGCAGC ATCCTCTCTTC AGC AGGCATC ACTGAA 
GCTGTTCGAAATGGCATACAAAAAGATGGCATCTGAGCGAGAAGGCTCTGGAAGTTCTGGCACTGGGGAA 
C AAAAGGAAG ATC AAAAGGAGG AAAAAC AGT AAT AAT AGC AG AAATTTTGAAGCC AGAAGGAC AAC ATAT 
GAAGCTTAGGAGTGAAGAGACTTCCTGAGCAGAAATGGGCGAACTTCAGTCTTTTTACTGTGTTTTTGCA 

GTATTCTATATATAATTTCCTTAATTTC^ 

G ATTC T AAC AGTATAAAGTTC AC AATATTCTATGTCC CT AGC CTGTC ATTTTTC AGCTGC ATGT AAAAGG 
AGGTAGGATGAATTGATCATTATAAAGATTTAACTATTTTATGC TG AAGTG AC CATATTTTCAAGGGGTC 
AAACC ATC TC GC AC AC AGC AATGAAGGTAGTC ATCCATAG AC TTGAAATG AG ACC AC ATATGGGGATG AG 
ATCCTTCTAGTTAGCCTAGTACTGCTGTACTGGCCTGTATGTACATGGGGTCCTTCAACTGAGGCCTT^ 
AAGTCAAGCTGGCTGTCCCATGTTTGTAGATGGGGCAGAGGAATCTAGAACAATGGGAAACTTAGCTATT 
TATATTAGGTACAGCTATTAAAACAAGGTAGGAATGAGGCTAGACCT^ 

TCTAGCTACCTTCTGCCCTGTGTCTGGCACCTACATCCTTGATCATTGTTCTCTTACCCATTCTGGAAT 
TTTTTTTTTTTAAATAAATACAGAAAGCATCTTGAAAAAAAAAAAAAAAAAA 

>gi | 30150447 | ref | XM_293066 . 2 | Homo sapiens similar to Stress-70 
protein, mitochondrial precursor (75 kDa glucose regulated protein) 

(GRP 75) (Peptide-binding protein 74) (PBP74) (Mortalin) (MOT) 

{LOC344510) , mRNA 

TCCATTTCGCAGAAATAAGAAGGAGAATAAAACTTAGAGAAAAAGAGGAGGCCAAAAGGGGCAAGGTAGT 

GGCT ACTTGC AGAAAC C TG AAAAAT ATGTGTG AC AAAGC CGCC ATAATAAGTGCC AGCCG AGCTGC AGC A 
GCCCGTCTCAGGGGCACCGCAGCCTCCCCGGGCCCTACGGCCGCCCGCCAGCAGGATGGCTGGAATGGCC 
TTAGTCATGAGGCTTTTAGAATTGTTTCAAGGCAGGATTATGCGTCAGAAGCAATCAATGGAGCAGCTGG 
TGGTGTTTATTTGGGTACTACCAACTCCTGTGTGGCAGTTATGGAAGGTAAACAAGCAAAGGTGCTGGAG 
AATGCCGAAGGTGCCAGAACTACCCCTTCAGTTGTGGCCTTTACAGCAGATGTACAGAAAGACATTAAAA 
ATATTCCCTTTAAAATTGTCTGTGCCTCCAATGGTGATGCCTGGGTTGAGGCTCATGGGAAACTGTATTC 
TCCAAGTCAGATCGGAGCATTTGTGTCGATGAAGATGAAAGAGACTGCAGAAAATTACTTGGGGCACACA 
GC AAAAAATGC TGTG ATCGC AGTC CC AGC TTATTTC AATG AC TTGC AGAGGC AGGCC AC TAAAG ATGC TG 
GCC AGATATC TGG ACTGAATGTGC TTCGGGTGATTAATG AACC C AC AGC TGC TGC TC TTGC C T ATGGTC T 
AGACAAATCAAAAGACAAAGTCATTGCTGTATATGATTTAGGTGGTCGAACTTTTGATATTT 
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G AATTTC AGAAAGG AGT ATTC G AGGTG AAATCC AC AAATGGGGAC AC TTTCTTAGGTGGGGAAG ACTTTG 
ACCAGGCCTTGCTACAGCACATTGTGAAGGAGTTCAAGAGAGAGACGGGGGTTGATTTGACCAAAGACAA 
CGTGGCATTTCAGAGGGTGTGGGAAGCTGCTGAAAAGGCTAAATGTGAACTCTCCTCATCTGTGCAGACT 
GAC ATC AATTTGC CC TATC TTAC AATGGATTC TTCTGG ACCC AAGCATTTGAATAC AAAGTTG ACCC ATG 
C TC AATTTGAAGGGATTGTC AC TG ATC TAATC AGGAGG ACTATCGC TCC ATGCC AAAAAGC TATGCAAGA 
TGC AG AAGTC AGC AAGAGTG ATATAGG AG AAGTG ATTCTTGTGG AGCC CCC AAGT AAAGC TGTC AATC CT 
GATGAGGCTGTGGCCATTGGAGCTGCCATTCAGGGAGATGTGTTGGCCGGCGATGTCACAGATGTGCTGC 
TCCTTGATGTC ATTCCCCTGTCTCTGGGT ATTGAAAC TC TGCGAGGTGTCTTT AC C AAAC TTATTAATAG 
G AAT ACC ACTATTCC AAC CAAGAAG AGCC AGGT ATTC TC AAC TGCTGC TG ATGGGC AGAC AC AAGTGGAA 
ATTAAAGTGTGTCAGGGTGAGAGAGAGATGGCTGGAGACAACAAACTCCTTGGACAGTTTACTTTGATTG 
G AATTCC AC C AGCCCC TCGTGG AGTTC C TC AGATC AAAGTTAC ATTTG AC ATTGATGCC AATGGGATC AT 
AC ATGTTTC AGC TAAAGATAAAGGC AC AGG AC ATG AGC AGC AGATTGTAATCC AGTCTTCTGGTGGGTTA 
AGCAAAGATGATGTTGAAAATATGGCATCATTGAAGCTCTTCAAAATGGCATACAAAAAGATGGCAACTC 
AACG AG AAGGC TCTGG AAGTTCTGGC AC TGGGG AAC AAAAGGAAGATC AAAAGG AGGAAAAAC AGTAA 

>gi| 7706213 |ref | NM_016284 . 1 | Homo sapiens KIAA1007 protein (KIAA1007) . 
mRNA 

AACAAGGTCCTGGGTAAGTAGTGGGAGTTCTCCTTCAGGATCATGATGTTCGTCAGAGTGAATTTCAGCA 
ACTTCCC TAC C ATCG AATTTTTATCATGCTTCTC TTGG AACTC AATGC ACCTG AGC ATGTGTTGG AAACC 
ATT AATTTCC AG AC ACTT AC AGCTTTCTGCAATAC ATTCC AC ATC TTG AGGC C T ACC AAAGCTCCTGGCT 
TTGTATATGC CTGGC TTGAACTGATTTCC C ATC GGAT ATTT ATTGC AAGAATGCTGGC AC ATACGCC AC A 
GC AG AAGGGGTGGCC TATGT ATGC AC AGC TAC TG ATTGATTTATTC AAAT ATTTAGCGCCTTTCCTT AGA 
AATGTGG AAC TC ACC AAACC TAT GC AAATC CTCTACAAGGGCACTTTAAGAGTGCTGCTGGTTCTTTTGC 
ATG ATTTC C C AG AGTTC C TTTGTGATT ACC ATTATGGGTTC TGTGATGTG ATCCC ACCT AATTGT ATC CA 
GTT AAG AAATTTG ATC CTGAGTGCC TTTCC AAGAAAC ATGAGGC TC CC C GACCCATTC ACTCC TAATC T A 
AAGGTGG AC ATGTTG AGTGAAATTAAC ATTGCTCC CC GGATTCTCACC AATTTC AC TGGAGTAATGCC AC 
CTC AGTTC AAAAAGGATTTGGATTCCT ATC TT AAAAC TC G ATC ACC AGTC ACTTTC C TGTC TG ATCTGC G 
CAGAACC GTAC AGGTATCC AATGAAC C TGGGAATC GC T AC AACC TC C AGCTC ATC AATGC ACTGGTGC TC 
TATGTC GGGACTCAGGCC ATTGC GC AC ATC C AC AAC AAGGGC AGC AC ACCTTC AATGAGCACC ATC ACTC 
ACTCAGCACACATGGATATCTTCCAGAATTTGGCTGTGGACTTGGACACTGAGGGTCGCTATCTCTTTTT 
GAATGCAATTGCAAATCACTCCGGTACCCAAATAGCCACACTCACTACTTCAGTTGCACCATGCTGTACC 
TTTTTGCAGAGGCCAATACGGAAGCCATCCAAGAACAGATCACAAGAGTTCTCTTGGAACGGTTGATTGT 
AAATAGGCC AC ATC C TTGGGGTCTTC TT ATTACCTTC ATTGAGCTGATTAAAAACCC AGCGTTTAAGTTC 
TGG AACC ATG AATTTGTAC ACTGTGC C CC AG AAATC GAAAAGTTATTCC AGTCGGTC GC AC AGTGCTGC A 
TGGGACAGAAGCAGGCCCAGCAAGTAATGGAAGGGACAGGTGCCAGTTAGACGAAACTGCATCTCTGTTG 
TACGTGTCAGTCTAGAGGTCTCACTGCACCGAGTTCATAAACTGACTGAAGAATCCTTTCAGCTCTTCCT 
GAC TTTCC CAGCCCTTTGGTTTGTGGGTATC TGC CCC AAC TAC TGTTGGGATC AGCC TCC TGTC TTATGT 
GGGCACGTTCCAAAGTTTAAATGCATTTTTTTGACTCTTGGCCAAAATTTAGAAGA 

T TTTGAAC TTGTGT AAAT AC ATG AAAG AG AAAAC CTT TGTC TGG AAC TTCTTGGCTTTGTGCAAGC TGTC 
TCC AAGGC AAGTACATAAACTGGTACC TTGTAATG AAG AGGC AGC TGATGC C ATGC AC TTGTCTG AGGGC 
ATAGCTCCATGTCTTCTGACATTCCTGGTCTCCCAAAGAATAGCAAAAAGCCAGTTTGAATATTATGTAA 
C TTATTTTTTTAATGTGG AC AGGGGACCTTGAAAATC ACTAAGTT ATTAAAAATGTGG ATGTGCTAGAAT 
TGGATATGTCCAGGAACATGGGAAGGGCTCACTATTGGAATCCCATGAGTTTCCATTTTGTCTCTACCCA 
AACGTATTC CAAAGCTG AC TGC ATTTGT AC CATC TTATTTC TTTTGGGG ATT AT AC AC CTCAGCCGCCTG 
AGATGGGGGTCAGCTCTTTATATAAAGGGAAACCAGACCAGGCCTAAAGCCCACCCCCTACCCTCACCCC 
C AC AATC CTCTCCTGAAAC TTAAAAACAGTGGGAATATAGGAAAGGG AACC AAATC TC ATTAATTAATTG 
TTCTCCCCC ATT AC CCC AC TG AATGAATGGCC ATAC AGGC TAAGCTGAATAATG AC AAAGTTG AAAGG AC 
C AATAC AGC C CCTTTT AT AAGGATTTTGAATGTTTTGC AAATGT ATTGGTC CCTGTGTTGT ATTT 
CCTTTTCCTGGGCTTCAGCTCCCCTACTTCTTGTATGTGTATGCATACTGTAGCTAACCATTAAAGTCAT 
GAC AC AC AAAAAAAAAAAAAAAAAAAAAAAAAAA 

>gi | 27479970 | ref |XM_034594 . 7 | Homo sapiens KIAA1604 protein (KIAA1604} , 
mRNA 

ATTTC C GTC AGC AC AGGCGCTGGATTTCGGACTTTGTAGTTCATGCCAACTTCTCTTTTTTGTCGCC ATT 
TTGGC AAC AAG AC AGTGTGC CTGG ATC GACTACTTTGAC TCTGGTATTATGATATGGTATTTG AAAT AAA 
TTAAAGGG AT AAC GTGTC AAGC C AATTG AGCGTC AAGTTTTT AAAT ATTTC ACTATTC GC C TTCTTCC T A 
CC GTCGGTGC TCGGG AAG AGGAAGTGAC GTAC GTCC GGTGTAATGGCGGC GCGGTGGAGC TCT AGGTG AT 
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gtttgcagacagaaaaagtaaactgtcaaggaattcagagcattgggtacttgttatttggaccagtttg 
aagcatctttgatttattggaccaactgcx:aacagaaaatgaaaagtagtgtggcacagataaaaccttc 
ttctggtcatgacagaagggaaaaccttaattcatatcagaggaactcctctccagaagacagatatgaa 
gaacaagaacgatccccccgggatagagattactttgattacagcagatcag^^ 
gaggacgttcttatgat agtagc atgg agtc ac g aaac agggaccgagaaaaacgc ag ag aaag ag aaag 
agat acgg atcggaaaaggtctcggaaatcccc atc tcc tggg agg ag aaaccc agaaac atc agtaac t 
cagagttcctctgctcaggatgaacctgctacaaagaaaaagaaagatgagctggatcctcttcttactc 
gc ac tggtggagc atatattcc cc ctgc aaagctc aggatgatgc aggaac ag att ac ag at aaaaac ag 
cttagcataccagaggatgagttgggaggccctgaagaagtcaattaatggccttatcaacaaagtcaac 
atttcc aac ataagtattattattc aagagcttc ttc aagaaaat at agtt ag agg aag aggactgc tgt 

CCAGGTCTGTTTTGCAAGCACAGAGTGCTTCTCCAATCTTCACCCATGTTTATGCAGCATTAGTGGCAAT 
TATCAACTCAAAATTTCCACAAATTGGAGAATTAATCCTCAAAAGGTTAATTCTTAATTTTCGAAAAGGC 
TATCGAAGAAATGACAAGCAACTTTGCCTGACTGCTTCAAAATTTGTGGCGCATCTTATTAACCAAAATG 
TGGCACACGAAGTATTATGCTTAGAGATGCTCACTTTGCTCCTGGAAAGACCAACAGATGATAGCGTTGA 
AGTAGC TATTGGTTTTCTTAAGG AATGTGGCC TC AAATTAAC AC AAGTGTCACCAAGAGG AATC AATGCT 
ATATTTGAAC GCC TTC G AAAC ATTC TGC ATG AGTC TG AAATTGAC AAAAG AGTTC AAT ATATG ATTG AAG 
TGATGTTTGCTGTACGGAAAGATGGATTCAAGGACCACCCCATTATCCTAGAAGGTCTTGATTTGGTGGA 
AGAAGATGATCAATTCACTCATATGCTCCCTCTGGAGGATGACTATAATCCAGAAGATGTTCTTAATG'IT 
TTC AAG ATGG ATC CTAATTTTATGG AGAATG AAG AG AAGT AC AAAGC T ATT AAGAAAG AAATTCTTG ATG 
AGGG AG AT ACTGAC TCGAAC AC AG AC CAGG ATGCTGGGAGT AGTG AAG AGG AC G AGG AAG AAG AAG AGG A . 
AG AG GG AGAAG AAG ATG AAGAAGGAC AAAAAGT AAC T ATTC ATG AC AAAAC AG AAATT AACCTGGTCTC A 
TTTC GTC GTAC AATTTATC TTGCT ATTC AGTC AAGTTT AG ATTTTG AAG AATGTGCTC AC AAATTGCTG A 
AAATGGAGTTTCCTGAAAGCCAAACAAAAGAACTCTGCAACATGATACTTGATTGCTGTGCCCAACAGAG 
GACATACGAAAAATTTTTTGGCTTATTAGCTGGGCGATTTTGCATGCTAAAGAAAGAGTACATGGAATCC 
TTTGAAGGTATATTC AAAG AAC AGTATGAT ACC ATC C ATCGCTTGGAAACAAAC AAGTTGC GAAATGTTG 
CTT^GATGTTTGCTCACCTTTTATACACTGATTCACTTCCATGGAGTGTTCTTGAATGTATAAAACTGAG 
TGAAGAAACC ACT AC ATC ATCC AGT AG AATTTTTGTC AAAAT ATTTTTC C AGGAACTGTGTG AAT AC ATG 
GGTCTTCCTAAACTTAATGCAAGATTAAAGGATGAAACTCTGCAGCCATTCTTTGAAGGATTATTACCCC 
GAG AT AATCC AAG AAAC ACTCGGTTTGC CATC AAC TTC TTTACTTCTAT AGGTCTTGGAGGTTT AAC GG A 
TGAACTGCGGGAGCATCTCAAAAATACACCAAAGGTCATTGTGGCGCAGAAACCAGATGTTGAGC AAAAT 
AAATCCTCCCCATCCTCTTCCTCTTCAGCGTCCTCCTCTTCAGAGTCTGACTCATCCGACTCTGATTCTG 
ACAGCAGTGATAGCAGTTC^GAGTCTTCCAGTGAAGAGA(X:GACTCTTCATCCATCAGTAGTCATAGCTC 

TGCC TC AGC T AATGATGT AAG AAAGAAG GG AC ATGGG AAG AC C AGAAGTAAAGAGGTAGATAAATTGATC 
AGAAACCAGCAAACAAATGATAGGAAACAAAAAGAAAGAAGACAAGAACACGGGCACCAGGAAACAAGGA 
CTGAGAGAGAAAGAAGGTCAGAAAAACACAGAGATCAAAATTCAAGTGGTTCAAATTGGAGAGATCCTAT 
AAC AAAGTAC AC ATC AG AC AAAG ATGTTCCTTC TGAACGAAAT AACTAC AGT AGAGTTGC G AATGAC AGA 
G ACC AAG AAATGC ATAT AG ATTTGGAAAATAAGC ATGGTGATC CC AAAAAG AAGAG AGGAG AG AG AAG AA 
ATTCTTTTTCTGAAAATGAGAAGCATACACACCGAATTTUU^GACAGTGAAAATTTCAGAAGAAAAGATAG 
ATCAAAGTCAAAGGAAATGAATAGAAAGCACTCAGGCTCAAGAAGTGATGAAGATAGATATCAAAATGGT 
GCCG AG AGACG ATGGG AAAAATCTAGC AGAT ACTCTG AAC AATC CAGAGAATCAAAGAAAAATC AGG AC C 
GGCG AAG AG AAAAGTCTCC AGC AAAAC AAAAATAATTCTACAAAATGAC AT AAAC TGG AC ATGC CTT ATA 
TTCAGTTGAAACAAATTATTTTTTACTTTGACGAACTTTATAGAGAATTCTTGTATAAAGTATTGGTTTG 
TATTTGT AAATTTAAAAAATGTTTTC ATTTT AAC ATGTTTT AT AATTG ATAC ATCTC AAGGC 
TAAAATTATTTGAAAACTTTCACCAGAAAGGGATGTAAGTCTCGCTTATATTTTGTGAATAAAATGATCA 

AAAGCTC 

>gi | 14042952 | ref | NH_032031 . 1 | Homo sapiens FKSG17 (FKSG17 ) , mRNA 

AG ACTTTCCCCCCTTCTTTAC AGC ATTC TATC AAAAT AC AGTGTTTTGGGCGTCCGGCTTCCACGTTCCC 

TCCACAAAATGCCTGGCGAAGCCACAGAAACCGTCCCTGCTATAGAGCAGCAGTTGCTGCAGCCCCAGGC 

TGAG AC AGGGTCTGG AAC AG AATCTG AC AGTGATGAGTC AGT ACC AG AGC TTGAAG AAC AGG ATTCC ACC 

CAGGTAACCGCACAAGTCCAGTTGGTGGTAGCAGCTGAAATTGATGAAGAACCAGTCAGTAAAGC AAAAC 

AG AGGCGG AGTGAAAAGAAGGC AC GGAAGGC TAGGTTC AAAC TGGGTCTTC AAC AGGTTAC AGG AGTT AC 

TAG AGTCACC ATC CGG AAATC T AAG AAT ATCCTCTTTGTC ATC AC AAAAC C AG ATGTC TAC AAG AGCC C G 

GC TTC GG AT AC CT AC ATGG TTTTTGGGG AAGCC AAGATTG AAG ATTT ATC TC AGG AAGC AC AACTAGC AG 

CTGCTGAGAAATTCAAAGTTCAAGGTGAAGCTGTCTCAAACATTCAAGAAAACACACAGACTCCAACTGT 

AC AAG AGGGC AGTG AAG AC GAAGAGGTC G ATG AAAC AGGTGTAGAAATTAAGG AT ATAG AATTGGTCC TG 

TCACAAGCAAATGTGTGGGGAGCAAAGGCAGTCCGAGCCCTGAAGAACACTAATGATATTGTAAATGCTA 
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TTATGGAATTAACAATGTAACCATCTGAAAGCAACTTTTTTGGTGTCTCAGATGAGTAACTGCAGCT^ 

TTTGAAATTTGTACTGTTTCTATCATAAATAAAGTTATGGCTTCTTGTTGGATGAAAAAAAAAAGACAGT 

ATTTTGCCAAATTAAACAGAAAAGAGAGTTCAGTGGAAGATGGCTCTGATTTC 

AGGTAACAATTAAATGGCATTTAAAAATTTGCTGAATGCTTGCTACGAGCGGTGGCTCATGCCTGAATCT 
CAGCACTTTGGGAGGCCAAGGCAGGCAGATCATCTGAGGTCATGAGTTCGAGACCAGCCTGGCCAACATG 
ATGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGTGCACACCTGTAATCCCAGTT 
ACTCAGGAGGCTGAGGCAGGAGAATCACTTGAATCCAGAAGGCGGAGGTTGCAGTGAGCTGAGATTGCAC 
CACGGCACTCCAGCCTGGGTGACAGAGTGAGACTCTGTCTCAAA 

>gi | 20541583 | ref |XM_166571 . 1 | Homo sapiens KIAA0363 protein (KIAA0363), 
mRNA 

GGGAGCCCTGTGCTCTGACCCCAGGGCCCAGCCACCTGGCCCTCACGTTCCTGCCCAGCAAGCCGGGTGC 
CCGGCCCCAGCCCGAGGGAGCCAGCTGGGATGCAGGGCCTGGTGGGGCACCCTCGGCCTGGGCGGACCCA 
GGGGAGGGCGGCCCAAGCCCCATGCTCCTCCCTGAGGGCCTGTCTTCCCAGGCTCTGTCCACGGAGGCTC 
CTCTCCCGGCCACCCTGGAGCCCCGGATTGTGATGGGAGAGGAGACGTGCCAGGCCCTCCTGTCACCCAG 
AGCTGCCCGGACAGCGCTCAGGGACCAGGAGGGTGGGCACGCAAGCCCAGACCCACCCCCCGAGCTGTGT 
TCTCAGGGTGATCTTTCTGTGCCTTCCCCTCCCCCAGACCCTGATTCCTTCTTCACGCCTCCCTCCACCC 
CCACCAAGACCACCTATGCCCTGCTTCCTGCCTGTGGGCCCCACGGGGACGCCAGGGACTCAGAGGCTGA 
GCTGCGGGATGAGCTGCTGGACTCGCCCCCCGCCTCACCCTCGGGCTCCTACATTACGGCCGATGGGGAC 
AGCTGGGCCTCTTCACCCTCCTGTTCCCTCAGCCTGCTGGCTCCGGCTGAAGGGCTGGACTTCCCCTCAG 
GCTGGGGCCTGTCCCCGCAGGGGTCCATGGTGGATGAACGAGAGCTGCACCCAGCAGGGACCCCAGAGCC 
CCCGTCCTCTGAGTCCAGCCTCTCTGCAGACAGCAGCTCCTCCTGGGGCCAGGAGGGCCACTTCTTCGAC 
CTGGACTTCCTGGCCAATGACCCAATGATCCCCGCAGCCCTCCTACCCTTCCAGGGCAGCCTCATCTTTC 
AGGTGG AGGC AGTGG AGGTGAC AC CGC TATC CCC AG AGGAAG AAGAAGAGGAGGCTGTGGCGGATC CCGA 
CCCAGGTGGGGACCTGGCTGGGGAGGGTGAGGAGGACAGCACGTCTGCCTCCTTCCTGCAGTCACTGTCT 
GACCTGTCCATCACGGAGGGCATGGACGAGGCTTTTGCCTTCCGGGACGACACCTCTGCAGCCTCCTCTG 
ATTC AG ACTC AGCC TCC TACGC AG AGGCAGATGATG AGAGGC TGT AC AGC GGGGAGCC CC ATGC CC AGGC 
C AC TTTGCTCC AGG AC AGTGTCC AGAAGAC AGAGGAGG AG AGCGG AGGTGGGGCC AAGGGGCTGC AGGC T 
CAGGATGGGACTCTGTCCTGGGCCGTGGAGGCTGCTCCTCAGACCTCAGACAGAGGGGCCTATCTGTCCC 

AG AGAC AGG AATTG ATC TC AG AAGTAAC AGAAG AGGGCCTTGC TTT AGGC C AGG AGTCC ACTGC CAC TGT 

GACCCCTCACACTCTGCAGGTAGCCCCAGGCCTCCAGGTGGAGGTGGCTACCAGAGTGACCCCACAGGCT 

GGGGAGGAAGAAACAGACTCCACCGCTGGACAAGAATCTGCTGCCATGGCAATGCCTCAGCCCTCCCAGG 

AGGGCATCAGCGAGATCTTAGGCCAAGAGTCTGTCACTGCAGAAAAACTTCCAACTCCACAGGAAGAAAC 

AAGCCTCACATTGTGTCCAGACTCTCCTCAGAACTTGAAGGAAGAAGGAGGGCTGGACCTCCCCTCTGGC 

AGAAAGCC TGT AGCTGC AGCC ACGATTGTCCCC AGGC AGGC TAAAG AGG AC CTCACCTTACCCC AGG ACT 

CCGCTATGACACCGCCTCTGCCCCTACAAGACACAGATCTCTCATCAGCCCCAAAGCCTGTGGCTGCAGC 

C AC G ATTGTGTCCC AGC AGGC TGAAGAGGGC CTCACCTTACCCC AGG AC TCCGTTATG AC AC CGCCTCTG 

CCCCTACAAGACACAGAACTCTCGTCAGCCCCAAAGCCTGTGGCTGCAGCCACGCTTGTGTCCCAGCAGG 

CTGAAGAGGGCCTCACCTTACCCCAGGACTCCGCTATGACACCGCCTCTGCCCCTACAAGACACAGATCT 

CTCGTCAGCCCCAAAGCCTGTGGCTGCAGCCACGCTTGTGTCCCAGCAGGCTGAAGAGGGCCTCACCTTA 

CCCCAGGACTCCGCTATGACACCGCCTCTGCCCCTACAAGACACAGATCTCTCGTCAGCCCCAAAGCCTG 

TGGCTGCAGCCACGCTTGTGTCCCAGCAGGCTGAAGAGGGCCTCACCTTACCCCAGGACTCCGCTATGAC 

ACCGCC TCTGC CCC TAC AAG AC AC AGATC TCTCGTC AGCC CCAAAGC CTGTGGCTGC AGC CAC GATTGTG 

TCCCAGCAGGCTGAAGAGGGCCTCACCTTACCCCAGGACTCCGCTATGACACCGCCTCTGCCCCTACAAG 

ACACAGATCTCTCGTCAGCCCCAAAGCCTGTGGCTGCAGCCACGATTGTGTCCCAGCAGGCTGAAGAGGG 

CCTCACCTTACCCCAGGACTCCGCTATGACACCGCCTCTGCCCCTACAAGACACAGATCTCTCGTCAGCC 

CCAAAGCCTGTGGCTGCAGCCACGCCTGTGTCCCAGCAGGCTGAAGAGGGCCTCACCTTACCCCAGGACT 

CCGCTATGACACCGCCTCTGCCCCTACAAGACACAGATCTCTCGTCAGCCCCAAAGCCTGTGGCTGCAGC 

CACGCCTGTGTCCCAGCAGGCTGAAGAGGGCCTCACCTTACCCCAGGACTCCGCTATGACAGCACCTCTG 

CCTCTGCAAGACACAGGCCCCACCTCAGGTCCAGAGCCTCTGGCTGTGGCCACCCCTCAAACCTTGCAGG 

C AGAAGC AGGC TGTGC CC C AGGG AC AG AGCCTGTGGC C ACC ATGGC TCAGC AGGAAGTAQGTG AGGC C TT 

AGGCCCCAGGCCAGCACCTGAGGAGAAGAATGCAGCCCTCCCTACAGTCCCGGAGCCTGCAGCCTTGGAC 

CAGGTCCAACAGGATGACCCACAGCCAGCTGCAGAAGCTGGGACACCTTGGGCCGCACAGGAAGATGCGG 

ATTCCACTTTGGGCATGGAGGCCCTCAGTCTCCCTGAGCCGGCCTCTGGTGCTGGGGAGGAAATAGCAGA 

AGCCCTTTCTAGGCCTGGACGGGAAGCATGTCTGGAAGCGCGAGCGCACACAGGTGATGGGGCTAAGCCT 

G ACTC ACC CCAAAAGGAGACCCTGGAGGTTGAGAACC AGC AGG AAGG AGGCC TGAAGC C ACTGGC AC AGG 
AACATGGACCCAGGTCAGCACTTGGAGGTGCAAGGGAGGTCCCCGATGCGCCTCCTGCTGCCTGCCCTGA 
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GGTCAGCCAGGCCCGGCTCCTGAGCCCAGCCAGGGAGGAAAGAGGCCTGAGTGGCAAGTCCACCCCGGAG 
CCCACGCTTCCCTCAGCTGTGGCCACAGAGGCCAGTCTGGACTCCTGCCCAGAGTCTTCAGTAGGGGCTG 
TGTCCAGTCTGGACAGAGGCTGCCCTGACGCGCCTGCCCCCACGTCTGCACCAACCTCCCAGCAGCCGGA 
GCCTGTACTGGGTCTGGGCAGTGTTGAGCAACCCCACGAAGTACCCAGTGTCCTTGGCACCCCCTTGCTG 
CAGCCCCCAGAAAACCTTGCCAAGGGTCAGCCCAGCACGCCCGTGGACAGGCCCCTGGGCCCTGACCCTT 
CTGCTCCTGGTACCCTTGCTGGGGCAGCCCTACCCCCACTGGAGCCCCCAGCCCCCTGCCTGTGCCAGGA 
CCCCCAGGAAGACTCTGTGGAAGACGAGGAGCCCCCAGGCTCTCTGGGCCTCCCACCGCCCCAGGCAGGA 
GTCCAGCCTGCCGCTGCTGCTGTCTCAGGAACCACACAGCCTCTGGGGACTGGGCCGCGAGTCAGCCTCT 
CGCCTCACTCCCCACTCCTCAGCCCCAAGGTGGCCTCCATGGATGCCAAAGACCTGGCCTTGCAGATCTT 
GCCGCCTTGCCAAGTGCCTCCTCCCTCTGGGCCCCAGAGCCCAGCTGGCCCTCAAGGGCTCTCAGCCCCC 
GAGCAGCAAGAGGATGAGGACAGCCTGGAGGAAGACTCGCCTCGGGCCCTGGGCTCGGGCCAGCATTCGG 
ATAGCCACGGGGAGTCATCAGCCGAGCTGGACGAGCAGGACATCTTGGCTCCTCAGACCGTGCAGTGTCC 
AGCC C AGGC CCC AGC AGGCGGC AGTGAGGAG ACC ATCGC C AAAGC C AAGC AG AGTCGC AGTGAG AAG AAG 
GCCCGAAAGGCAATGTCAAAGCTGGGCTTGCGGCAGATTCAGGGAGTCACCAGGATCACCATCCAGAAGT 
CCAAGAACATCCTCTTTGTCATCGCCAAGCCTGATGTCTTCAAGAGCCCAGCCTCAGACACTTATGTGGT 
CTTTGGCGAGGCCAAGATTGAGGACCTGTCCCAGCAAGTGCACAAAGCCGCAGCTGAGAAGTTTAAGGTG 
CCCTCAGAGCCCTCAGCCTTGGTCCCTGAGTCAGCACCCAGGCCCCGGGTGAGGCTGGAGTGCAAGGAAG 
AGGAAGAGGAGGAGGAGGAAGAGGTGGACGAGGCGGGGCTGGAACTGCGTGACATTGAGCTGGTGATGGC 
GCAGGCCAATGTGTCCAGGGCCAAGGCCGTGCGGGCTCTGAGAGACAACCACAGTGACATCGTCAACGCC 
ATCATGGAACTGACCATGTAGCCACTGACCGGAAGCTGGAGCCATCCTACGCCTTCCCTCAGCTCTGCTA 

CTC AAT AAATC GGTGTCC CTTC 

>gi [27481387 |ref|XM_087502 .6| Homo sapiens similar to alpha NAC/1.9.2. 
protein (LOC152682) , mRNA 

CTTGGTTCCTCGTTCCCTGCACAAAATGCCCTGTGAAGCCACAGAAACCGTCCCTGCTACAGAGCAGGAG 
TTGCCGCAGCCGCAGGCTGAGACAGGGTCTGGAACAGAATCTGACAGTGATGAATCAGTACCAGAGCCTG 
AAGAAC AGGATTC CAC CC AG ACGTCC AC AC AAG AAGCCC AGCTGGTGGC AGC AGC TG AAATTG ATG AAG A 
GC C AGTC AGTAAAGC AAAAC AGATTTGG AGTG AAAAGAAGGC AC AG AAGGC TATGTCC AAACTGGGTCTT 
CGACAGGTTACAGGAGTCACTAGAGTCGCTATCCGGAAATCTAAGAATATCCTTTTTGTCATCACAAAAC 
C AAATGTC TGC AAG AGC CC TGCTTTGGAT AC C TAC ATAGTTTC TGGGGAAGCC AAGATCGAAGATTTATC 
TGAGC AAGC AC AAC T AGCAGCTGCTGAGAAATTC AATGTTC AAGGTGAAGCTGTC TC AAAC ATTC AAGAA 
AACACACAAACTCCAACTGTACAAAAGGAGAGTGAAGAAGAAGAGGTCGATGAAACAGGTGTAGAAGTTA 
AGG AC ATAG AATTGGTC ATGTC AC AAGC AAATGTGTGG AGAGC AAAGGC AGTC C ATGCC CTG AAG AAC AA 

CTC AAAGGAGT AAC TGC AGC TTGGTTTG AAATTTGTACTGTTTC TATC AT AAATAAAGTTATGGC TTC TT 
GTTGGA 

>gi | 5031930 |ref|NM_005594.1| Homo sapiens nascent - polypeptide - 
associated complex alpha polypeptide (NACA) , mRNA 

CTTGGTTCCGCGTTCCCTGCACAAAATGCCCGGCGAAGCCACAGAAACCGTCCCTGCTACAGAGCAGGAG 
TTGC CGC AGC CCC AGGCTG AGAC AGGGTCTGGAAC AGAATCTG AC AGTGATG AATC AGTACCAG AGCTTG 
AAGAAC AGGATTC CACCC AGGC AAC CAC AC AAC AAGCCC AGC TGGCGGC AGC AGCTGAAATTG ATGAAG A 
ACC AGTC AGTAAAGC AAAAC AGAGTCGGAGTGAAAAG AAGGC ACGG AAGGC T ATGTC C AAACTGGGTCTT 
CGGCAGGTTACAGGAGTTACTAGAGTCACTATCCGGAAATCTAAGAATATACTCTTTGTCATCACAAAAC 
CAGATGTCTACAAGAGCCCTGCTTCAGATACTTAC^TAGTTTTTGGGGAAGCCAAGATCGAAGATTTATC 
CCAGCAAGCACAACTAGCAGCTGCTGAGAAATTCAAAGTTCAAGGTGAAGCTGTCTCAAACATTC AAGAA 
AAC AC AC AGAC TCC AAC TGT AC AAG AGGAGAGTG AAG AGG AAGAGGTC GATG AAAC AGGTGT AG AAGTT A 
AGGACATTGAATTGGTCATGTCACAAGCAAATGTGTCGAGAGCAAAGGCAGTCCGAGCCCTGAAGAACAA 
CAGTAATGATATTGTAAATGCGATTATGGAATTAACAATGTAACCATATGGAAGCAACTTTTTTTGGTGT 
CTC AAAGGAGT AAC TGC AGCTTGGTTTGAAATTTGT AC TGTTTC TATCAT AAATAAAGTTATGGC TTCTT 
GTTGGAAAAAAAAAAAAAAAAAAAAAA 

>gi | 4758733 |ref|NM_004279.1| Homo sapiens peptidase (mitochondrial 

processing) beta (PMPCB) , mRNA 

GTTCGTAGCAGAAATGGCGGCTGCGGCGGCTCGAGTGGTGTTGTCATCCGCGGCGCGAGGCGGCCTCTGG 

GGTTTCAGCGAGAGTCTTCTAATCCGAGGCGCTGCGGGACGGTCATTATATTTTGGAGAGAACAGATTAA 
G AAGTAC AC AGGC TGC TACCC AAGTTGTTC TGAATGTTC C TG AAAC AAG AGTAACATGTTTAGAAAGTGG 
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ACTCAGAGTAGCTTCGGAAGACTCTGGGCTCTCAACATGCACAGTTGGACTCTGGATTGATGCTGGAAGT 
AGATACGAAAATGAGAAGAACAATGGAACACCACACTTTCTGGAGCATATGGCTTTCAAGGGCACCAAGA 
AG AGATCC C AGTTAG ATC TGGAACTTGAG ATTGAAAATATGGGTGC TC ATC TC AATGCCT ATACC TCC AG 
AGAGC AGACTGTATAC T ATGCC AAAGC ATTCTCT AAAGACTTGC C AAGAGC TGT AGAAATTC TTGC TG AT 
ATAATACAAAACAGCACATTGGGAGAAGCAGAGATTGAACGTGAGCGTGGAGTAATCCTTAGAGAGATGC 
AGGAAGTTG AAACCAATTTAC AAGAAGTTGTTTTTGATTATCTTCATGCC AC AGCTTATC AAAATAC TGC 
ACTTGGACGGACAATTTTGGGACCAACTGAAAATATCAAATCTATAAGTCGTAAGGACTTAGTGGATTAT 
ATAACC AC AC ATT ATAAGGGGCC AAG AATAGTGCTTGC TGCTGC TGGAGGTGTTTCC C ATGATGAATTGC 
TTGACTT AGCAAAGTTTC ATTTC GGTGACTC TTTATGC AC AC AC AAAGG AGAAAT AC C AGCTCTGCCTCC 
CTGC AAATTC AC AGGAAGTGAGATTCGTGTGAGGG ATGAC AAG ATGCC TTTGGCGCAC CTTGC AATAGCT 
GTTG AAGCTGTTGGTTGGGC AC ATCC AG ATAC AATC TGTC TC ATGGTTGC AAAC ACGCTG ATTGGC AACT 
GGGATCGCTC TTTTGGGGGAGGAATGAATTTATCTAGC AAGCTGGC CC AGCTC AC TTGTC ATGGC AATCT 
TTGCCATAGCTTTCAGTCTTTCAACACTTCCTACACAGATACAGGATTATGGGGACTGTATATGGTTTGT 
GAATCATCCACTGTTGCAGACATGCTACATGTTGTTCAAAAAGAATGGATGCGACTCTGTACAAGTGTCA 
CAGAAAGTGAGGTTGCACGAGCCAGAAATCTTCTGAAAACAAACATGTTGTTGCAGCTTGATGGTTCAAC 
TCCAATTTGTGAAGATATTGGTAGGCAAATGTTATGCTATAATAGAAGGATTCCCATCCCTGAGCTTGAA 
GCAAGAATTGATGCTGTGAATGCTGAGACAATTCGAGAAGTATGTACCAAATACATTTATAATAGGAGTC 
C AGCT ATTGCTGC TGTTGGTCCC ATTAAGC AAC TACC AG ATTTTAAAC AGATAC GC AGT AAC ATGTGTTG 
GCTTCGTGATTAAAATGCTCCTAATCAAGATTGTTTGAACACATGTATTTATAAAACAGAGCTAGAGAAA 
AATAAAAATGAACATGTATATACATTTGGAAATTTGAATTAAATACTGTATCATACTTTCAAAGGATAAA 
AAG ACT ACCCC TC TG AAGGTTGTTTTGTATTAATGGTC AGTC TTTGTTCTCTGAGAAATT ATGTTGG AAG 
CAGCATACT'rTCAAATTATTACCATGAGTATAATTTTAAGAATGAAAATGTTTACAGTATTTTCAGTTTT 

ATTATAAAAATGCACACACAA 

>gi|4507840|ref | NM_003365 . 1 | Homo sapiens ubiquinol-cytochrome c 
reductase core protein I (UQCRCD , mRNA 

TACTGACTTTCAGCACCAACTTGTGGTCCCAGGTAAGTTTCCACGCTGGTACTTTAGCCCTGGGCTCGAA 

CCTGCGGACACGCTGTGGCTGCAACTCCCCGCCCGCCAGACCTCAGTACGCAGCGCGGCTGGTGAGAAAC 

ATAATGCACTCTGACTTCCACGCGTGGAATGGGGAGATGGACTGCACGGCGGGCGGACCTGCTCGGGCTG 

ATGGACGGCAGGTGGACTGATGTGCGCAGGGACTGGCGGCAGCGCGGTCAGAGCCAGTCAGCCAAAGCCA 

GGCCAGCACAATAGACTGTCCCGGTTCCCGCCAGGAGGCGGCCGAGCACCAACTGTACGGTACTGCGCCT 

GCGCCGCGACCGCCAACGCGCCCAGTCTACGCTTGCGCGGCGCAACAGGGCCGACTGCAGCTGGAAGATG 

GCGGCGTCCGTGGTCTGTCGGGCCGCTACCGCCGGGGCACAAGTGCTATTGCGCGCCCGCCGCTCGCCGG 

CCCTGCTGCGGACGCCAGCCTTGCGGAGTACGGCAACCTTCGCTCAGGCGCTCCAGTTCGTGCCGGAGAC 

GCAGGTTAGCCTGCTGGACAACGGCCTGCGTGTGGCCTCCGAGCAGTCCTCTCAGCCCACTTGCACGGTG 

GGAGTGTGGATTG ATGTTGGC AGCCGTTTTGAG AC TGAG AAG AAT AATGGGGC AGGC TAC TTTTTGG AGC 

ATCTGGCTTTCAAGGGAACAAAGAATCGGCCTGGCAGTGCCCTGGAGAAGGAGGTGGAGAGCATGGGGGC 

C C ATCTT AATGCCT AC AGC ACCC GGGAGC AC AC AGCTTAC TAC ATC AAGGCGCTGTC CAAGG ATC TGCC G 

AAAGCTGTGGAGCTCC TGGGTGACATTGTGC AGAACTGT AGTC TGGAAG ACTC AC AG ATTGAG AAGGAAC 

GTGATGTGATCCTGCGGGAGATGCAGGAGIAATGATGCATCTATGCGAGATGTGGTCTTTAACTACCTGCA 

TGCC AC AGC ATTCC AGGGC AC ACC TC TAGCCC AGGCTGTGG AGGGGC CC AGTG AG AATGTC AGGAAGCTG 

TC TC GTGCAG AC TTGAC CG AGTAC CTCAGC ACACATTAC AAGGCCCC TCGAATGGTGCTGGC AGC AGCTG 

GAGGAGTGGAGCACCAGCAACTGTTAGACCTCGCCCAGAAGCACCTCGGTGGCATCCCATGGACATATGC 

AG AGG AC GC TGTGC CC ACTC TT AC TC CATGCCGC TTCACTGGC AGTGAGATCC GC C ACCGTG ATG ATGCT 

CTACCTTTTGCCCACGTGGCCATTGCAGTAGAGGGTCCTGGCTGGGCCAGCCCGGACAGTGTGGCCTTGC 

AAGTGGCCAATGCCATCATCGGCCACTATGACTGCACTTATGGTGGTGGCGTGCACCTGTCCAGCCCACT 

GGCTTCAGGTGCTGTGGCCAACAAGCTATGCCAGAGTTTCCAGACCTTCAGCATCTGCTATGCAGAGACG 

GGCTTGCTGGGTGCACACTTTGTCTGTGACCGAATGAAAATCGATGACATGATGTTCGTCCTGCAAGGGC 

AGTGGATGCGCCTGTGTACCAGTGCCACGGAGAGTGAGGTGGCCCGGGGCAAAAACATCCTCAGAAATGC 

CCTGGTATCTCATCT AG ATGGC ACTACTCCTGTGTGTGAGGACATCGGACGCAGCCTCCTGACCTATGGC 

CGCCGCATCCCCCTGGCTGAATGGGAAAGCCGGATTGCGGAGGTGGATGCCAGTGTGGTACGTGAGATCT 

GCTCCAAGTACATCTATGACCAGTGCCCAGCAGTGGCTGGATATGGCCCCATTGAGCAGCTCCCAGACTA 

CAACCGGATCCGTAGCGGCATGTTCTGGCTGCGCTTCTAGGCGGGAAGCCTATGTAAGCAAGAGGGCAGG 

GCCGGGGTTTGTGGTCCCCCCCCCACCACAAACACAGCACTTCGGCTCCTCTAACCTGTGCCACAGGTGA 

CCACCAATAAAATCCTCTGCTGAGA 
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>gi | 26051277 |ref|NM_172020.1 | Homo sapiens POM121 membrane glycoprotein 
(rat) (P0M121) , mRNA 

CGCTTCCTGCGCCTCTTCAGGTCACCGCTTGCTCTAGTTCCCAGGCTTTGGCCTCTAGTGGATGAGAATC 
ACCGAGTCTGCGGGGCTGGACGCTGACCGCCCGGGCCAGCACCTAGGCGGGCGGGAGCTGTGCGGCCCAG 
GGTTCGCGCGGGCCGGGTAGAGGCTCGAGCCGGGACCCCCGAGCGTGAACCCCGGAGCCAGCGGCGCTGG 
GGCCAGAGGGGCCAGGCGGGACGTGGTGGCGGAGGCGAAGGGGCGACGGGACCTGGGCCTGGCCCGTGTG 
TGTCCTCGGCGGCCTGGCGCCGGCCGTCGCTGTACGGTGAGCCCCAGGGAGGCGGATCTGGGCCCCGAGA 
AGGACACCCGCCTGGATTTGCCCCGTAGGCCCGGCCCGGGCCCCTCGGGAGCAGAACAGCCTTGGTGAGG 
TGGACAGGAGGGGACTTCGCGAGCAGACGCGCGCGCCAGCGACAGCAGCCCGCCCCGGCCTCTCGGGAGC 
CGTGGGGCAGAGGCTGCAGAGCCCCAGGAGGGGGCCAGTGTCATTCAAAGATGTGGCTGTGGATTTCACC 
C AGGAGG AGTGGTGGC AAC TGGAC C C TG ATG AG AAGATAAC ATACGGGGATGTGATGTTGGAGAAC TAC A 
GCCATCTAGTTTCCCTGGCTTATGAGGTGGCAACATCTTGTACTTCGGAGATCTGAAGCCGAGCAACTTG 
CCCAAGTCCTTCTTCTTTTCCCATTAACAAGATATGATATCACCAAGCCAAACGTCATCATTAAGTTGGA 
GC AGGG AG AGGAGCTGTGG ATAAC GGG AGGTG AATTTCC ATGTC AACATAGTCC AGGG ATTGTGGG AC TT 
TACC AAATCGGTTTGT AATAAC ACC TAGAAGACGCTATCCG ATCC ATC AGGCC C AGTATTCCTGTC TGGG 

ggtacttcccaccgtgtgctggaatggttatcacaagaaggctgtgctgtcccctcgcaactccaggatg 
gtgtgt agccc agtg actgtgaggatcgc c cc tc c tg ac ag aag attttcgcgttc tgc gat acc agagc 
agataatcagci<:aacactgtcctcaccatc^gtaacgccccagacccatgtgcaaaggagacagtact 
g agtgc cc tc aaagag aaggag aag aaaaggacagtggaggaag aag ac c aaatattccttgatggc c ag 

GAAAATAAAAGAAGGCGCCATGATAGCAGTGGCAGTGGACATTCAGCATTTGAGCCCCTGGTGGCCAATG 
GAGTCCCCGCTTCTTTTGTGCCTAAGCCTGGGTCTCTGAAGAGAGGCCTCAATTCTCAGAGCTCAGATGA 
CCACTTGAATAAGAGATCCCGAAGCTCTTCCATGAGCTCCTTGACAGGCGCTTACGCAAGTGGCATCCCT 

AGCTCC AGC CGC AATGCC ATTAC C AGTTCC TAC AGC TC C ACTC GAGGC ATCTC AC AGCTCTGG AAGAGAA 
ATGGCCCCAGTTCATCACCCTTCTCTAGCCCAGCCTCCTCCCGCTCCCAGACACCGGAGAGGCCAGCAAA 

GAAAATAAG AGAAGAGG AGCTGTGTC ATC AT TC C AGTTCTTC AACTC C ATTGGC AGC AGAC AGGGAGTCC 
C AGGG AGAAAAGGCTGC AG ATAC AACCC C AAGG AAG AAAC AAAAC TC G AATTC TC AGTCTAC ACC TGGCA 
GCTCTGGGCAGCGTAAGCGGAAAGTTCAGCTGCTGCCTTCTCGGCGAGGGGAACAGCTGACCTTGCCTCC 
ACCTCCCCAGCTTGGCTATTCGATCACTGCCGAGGACCTAGACTTAGAGAAGAAGGCTTCATTACAGTGG 
TTC AACC AGGC C TTGGAGGAC AAG AGCGATGCTGCCTC G AACTC TGTC ACTGAGACCC CACCTATC AC TC 
AGCCTTCATTTACCTTTACCCTGCCTGCTGCTGCACCTGCCTCCCCACCCACCTCCCTCCTGGCCCCAAG 
C AC C AAC C C AC TGTTAG AG AGC TTGAAGAAGATGC AG AC TC CCC CG AGC CTGC C ACCCTGC C C AGAATCT 

cx:tggagcagcaaccactgaggccctctcacctccaaagacacccagcctcctacccccgctgggtttat 

CACAGTCAGGGCCGCCAGGGCTGCTCCCCAGCCCCTCCTTTGACTCCAAACCCCCGACCACTTTGCTGGG 
GCTGATCCCTGCTCCATCCATGGTACCAGCCACTGACACCAAGGCACCTCCAACCCTTCAGGCAGAGACG 
GCTACCAAACCCCAAGCCACATCTGCCCCGTCCCCCGCCCCCAAGCAAAGCTTCCTGTTTGGAACACAGA 
ACACCTCACCTTCCAGCCCTGCCGCCCCTGCTGCATCTTCAGCACCTCCCATGTTCAAGCCCATTTTCAC 
GGCTCCACCCAAGAGTGAGAAGGAAGGCCCCACACCGCCTGGCCCTTCAGTCACAGCCACAGCGCCCTCC 
AGCTCCTCCCTCCCCACGACCACCAGCACCACAGCCCCGACCTTCCAGCCTGTCTTTAGCAGCATGGGGC 
CACCTGCATCTGTGCCCTTGCCTGCTCCCTTCTTCAAGCAGACAACTACTCCCGCCACTGCTCCCACCAC 
AACTGCCCCGCTCTTCACTGGCCTGGCCAGCGCCACCTCTGCTGTGGCTCCCATCACCTCTGCCAGTCCA 
TCCACAGACTCTGCTTCGAAGCCTGCGTTTGGCTTTGGCATAAACAGTGTGAGCAGCAGCAGTGTGAGTA 
CCACGACCAGCACCGCCACTCCCGCCTCACAGCCTTTCCTCTTCGGGGCGCCCCAGGCCTCTGCTGCCAG 
CTTCACCCCGGCCATGGGCTCCATATTCCAGTTTGGCAAACCTCCTGCCTTGCCCACAACCACCACAGTC 
ACCACCTTCAGCCAGTCCCTGCACACTGCCGTGCCAACGGCCACCAGCAGCAGCGCTGCCGACTTTAGTG 
GTTTTGGCAGCACCCTCGCCACCTCCGCCCCGGCCACCAGCAGCCAGCCCACTCTGACGTTCAGTAACAC 
GAGCACCCCCACGTTCAACATTCCCTTTGGCTCAAGCGCCAAGTCCCCGCTCCCATCATATCCGGGAGCC 
AACCCCCAGCCCGCATTTGGGGCCGCTGAGGGGCAGCCACCGGGGGCCGCCAAGCCGGCCCTTGCCCCCA 
GCTTTGGCAGCTCTTTCACTTTTGGAAACTCTGCAGCCCCGGCTGCTGCACCCACACCTGCACCTCCGTC 
CAT<^TCAAGGTCGTGCCTGCGTACGTGCCTACGCCCATCCATCCTATCTTTGGCGGTGCCACGCACTCG 
GCGTTTGGGTTGAAAGCCACGGCTTCGGCCTTCGGCGCTCCCGCCAGCTCACAGCCCGCCTTTGGCGGCT 
CCACTGCTGTCTTCTTCGGTGCAGCCACCAGCTCCGGCTTTGGAGCCACCACCCAGACCGCCAGCAGCGG 
GAGCAGCAGCTCGGTGTTTGGCAGCACAACACCATCACCCTTCACGTTTGGGGGTTCGGCAGCCCCCGCT 
GGCAGTGGGAGCTTTGGGATCAATGTGGCCACCCCAGGCTCCAGCACCACCACCGGAGCTTTCAGCTTTG 
GAGCAGGACAGAGTGGGAGCACAGCC^CCTCCACCCCCTTCGCAGGGGGCTTAGGTCAGAACGCCCTGGG 
CACCACCGGCCAGAGCACACCGTTTGCCTTCAACGTGAGCAGCACAACTCAGAGCAAACCTGTGTTTGGA 
GGCACCGCCACCCCCACCTTTGGTCTGAACACCCCTGCGCCTGGAGTGGGCACATCAGGCAGCAGCCTCT 
CCTTTGGGGCATCCTCAGCACCCGCCCAAGGCTTTGTTGGTGTTGCACCTTTCGGATCGGCGGCCCTTTC 
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ATTTTCCATTGGTGCGGGATCCAAGACCCCAGGGGCTCGACAGCGACTGCAGGCCCGAAGGCAGCACACC 
CGCAAAAAGTAGCCTTTGTCCCCTGTCCCTGTTCCCCCCACCCCTTCCCTAAATCTCGACCTTGGGACCT 
GCTAGGAAGAGCCTTGGACCCTTCCAGTTGCGTAAAGCAAACCTACCCCGGATCTCTGGCTTCAGCCGCC 
AGGGGGCAGTGGCAGCCCTGGGGCCCTTTCCCTTCTGGAGGAAGCACAAGCCTCAGGGAAGGGGAAGCAG 
GATGCGGAGGGCCAAAGCCCGGGACCTCTACTTGAACAGTTCTACTGGGGAGGCTGGAGAACTAAGGAAA 
CACCTGTACATAGTGTCCGCTGCCCTGACTCCCGCTTAGCACACCCTTAGGCAGGCGCCCCTTCCACCTT 
TCCCCGAGACCGTCGTCGCTGGAGGGGGCAGGGTCCAGCCCGCCTGGATCGGTGGTGTGCACCTGATGGG 
ATTTGGGAAATGGGCTATCCGTAAAGCTTTATCTTGCTTGGCTTAGCTGTGAGAAGTGGTTCTCTTCCTC 
TGGTCCCTTCTGGGGACTCTGTTTCCCCATTTCTTGCTGCTGTGTCCCTCACCAGTTCCTTGCAGGATTC 
CTTCGTTTTTAAATGCCCTTGAATCTAGCTTTGCCTTGGAGACCCCAGTGGGTGCTGCTCCTGCCGTTTT 
CTTCCTGCCAAGCCTGAATCAATGTTTCATCTCCAACCC 

CTC AAG ACTGTT AGCC TGGC AGAGC C AGGGGTG AAGGG AGAAGC TC TTGG AGC AGGC AGG ATGCCC ACCG 
CTGCTTCAGCTGCCTCCTCGCCCAGCTACCCTTTGGCCCCATTGGGCCCTCGTCTGCCTCTCCAGGATTG 
T ATGTTTCAAGCC TTGTCCTGTGTTCCTTTGTCTGAC GC TC TGTGTATTGC TCTTTGAATCG AGTTTGGA 
GGAAGAGTTGAGTTGTATGAGTGGCGGCATGTTGGTAGTGCCGGAC 

CTCGCTAATTGAATGTGGAAAGTAGCACCACTTGACGGCTACAAGTGCCGACTCCTGAATTTTCCCATGG 

TGTTCTGACTTCAAGGGCTGGCAGCCAGGGAGAATGGGCCCAGGGGAAGCAAAGACCTCTTCCCTCTGCC 

GTTTC TGTCCC ACTT AACTGACC TC ACTGGAGGCTAC ATC ACCC AAAGTAG ATGTT AGAAAAC CTAAATT 

AATGAACCATATTTTTAAAATCCTATTTTTCCCAAACAGGGCCCTCTGCAGCCCATCCTTTCCTTCCGTC 

CTTCTGAAACCACATACCCCAGGCCCAAGCGCCTTGCTGCCACGCCCAACCTCTTTGGGAGAAGTATGAA 

TGCGTGTGTCTAAATTAAAAGAAAAAAATATTTAAACGTTTTTTAACAAAAATTTATTTTO 

C TAAATTGCC TTTTAAATTCC TTC AAGC TTG GTTC ATTG AGGTGGTT AAGTAT AAATGCTATT AAC T AGG 

AATTAG CTGTATAGTTAAGTTATGCC TGTGC AAAG AAGAGGCTC AAATGCTGTCCC CGGCAGC TTTCC TG 

GGGG AC TAGAGC TCC TTCTGGC C ATGTT AT ATGAAATGTAATTCTTATTTTATAAATAATGTGATGTAAA 

TGTAACTGGTGCCCCCTCCCCGATGTGACTGAGGGTGAGTGAGTGGTGGCGGGGCTGCTCCTTCCCACCC 

C TC AGAAC AGCTC TG ATCCTCGTTAAT AC CTGGC TGC G1X5TGC AGCTGAGG AGGG AGTG AAC CTC AAGCC 

TAAATACCTGTTAGGATTGG AGGGTC TGGGTGGGCC TGGGCCT AGC AATC AAGC TTC TACC TGTACC TTA 

TGT AAGGTAGACCCTCCT AGTGTC AGT ACCTGAGCTAGTTTAC CTC AGTTC C GC AGGC AGG ACAGCC GGT 

CCGGGAACCCTGAGTGAGAATGAGTGTGGATGTGTACAGTACACGCACTGGACGGCAGCGGGAGGCTGGG 

AC TTTC C ATTAC AAATAG AG AC TTC ATTC CTGTTGAGTC TAGTTGG AATTTTT AGTATGAATGTGAGATT 

TTTCTCCTGCTTGTGACATTAAGAATAAAAAACTGTGATCTATCGTAAAAAAAAAAAAAAAAAA 

>gi|23110936|ref |NM_002787.2| Homo sapiens proteasome (prosome, 
macropain) subunit, alpha type, 2 (PSMA2), mRNA 

TTTGGCTCTTCGGGTAAAGATGGCGGAGCC^GGGTACAGCTTTTCGCTGACTACATTCAGCCCGTCTGGT 
AAAC TTGTC C AG ATTG AATATGC TTTGGCTGC TG T AGCTGG AGGAGCC CC GTCCGTGGGAATT AAAGC TG 
CAAATGGTGTGGTATTAGCAACTGAGAAAAAACAGAAATCCATTCTGTATGATGAGCGAAGTGTACACAA 
AGTAG AACC AATTACC AAGC ATATAGGTTTGGTGT AC AGTGGC ATGGGCC C CGATT AC AG AGTGC TTGTG 
CAC AG AGCTCGAAAAC TAGC TCAAC AAT ACTATC TTGTGTACC AAG AACC CATTCCTACAGC TC AGCTGG 
TAC AG AGAGTAGCTTC TGTG ATGC AAG AATAT ACTC AGTC AGGTGGTGTTC GTCC ATTTGG AGTTTC TTT 
ACTTATTTGTGGTTGGAATGAGGGACGACCATATTTATTTCAGTCAGATCCATCTGGAGCTTACTTTGCC 
TGG AAAGC TAC AGC AATGGG AAAG AACT ATGTG AATGGGAAGACTTTCCTTGAG AAAAG AT AT AATGAAG 
ATCTGGAACTTGAAGATGCCATTCATACAGCCATCTTAACCCTAAAGGAAAGCTTTGAAGGGCAAATGAC 
AGAGGATAACATAGAAGTTGGAATCTGCAATGAAGCTGGATTTAGGAGGCTTACTCCAACTGAAGTTAAG 
GATTACTTGGCTGC C AT AGC AT AAC AATG AAGTGACTGAAAAATCC AGAATTTC AG AT AATCT ATC TACT 
T AAAC ATGTTTAAAGTATGTTTTGTTTTGC AGACTTTTTGC AT AC TT ATTTC T AC ATGGTTTAAATCG AC 
TGTTTTTAAAATG AC AC TTATAAATC CT AAT AAAC TGTTAAACCC 

>gi | 23110937 | ref |NM_002788 .2 | Homo sapiens proteasome (prosome, 
macropain) subunit, alpha type, 3 (PSMA3), transcript variant 1, mRNA 
GCGCTCCGGGCCTGGAATCCCTACGCGTCCCTTTGGGTTTAGCACGATGAGCTCAATCGGCACTGGGTAT 
G AC CTGTC AGCCTCTAC ATTCTCTCC TGACGGAAGAGTTTTTCAAGTTG AATATGC T ATGAAGGCTGTGG 
AAAATAGTAGTACAGCTATTGGAATCAGATGCAAAGATGGTGTTGTCTTTGGGGTAGAAAAATTAGTCCT 
TTC T AAAC TTT ATGAAGAAGGTTCC AAC AAAAG ACTTTTTAATGTTG ATC GGCATGTTGGAATGGCAGTA 
GCAGGTTTGTTGGCAGATGCTCGTTCTTTAGCAGACATAGCAAGAGAAGAAGCTTCCAACTTCAGATCTA 
ACTTTGGCTACAACATTCCACTAAAACATCTTGCAGACAGAGTGGCCATGTATGTGCATGCATATACACT 
CTACAGTGCTGTTAGACCTTTTGGCTGCAGTTTCATGTTAGGGTCTTACAGTGTGAATGACGGTGCGCAA 
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CTCTACATGATTGACCCATCAGGTGTTTCATACGGTTATTGGGGCTGTGCCATCGGCAAAGCCAGGCAAG 
C TGC AAAGAC GG AAATAGAGAAGCTTC AGATG AAAGAAATGACC TGCCGTGAT ATCGTT AAAG AAGTTGC 
AAAAATAATTTACATAGTACATGACGAAGTTAAGGATAAAGCTTTTGAACTAGAACTCAGCTGGGTTGGT 
G AATT AAC TAATGG AAG AC ATG AAATTGTTCC AAAAG AT AT AAG AG AAGAAGC AGAG AAAT ATGC TAAGG 
AATCTC TGAAGG AAG AAG ATGAATC AGATGATG AT AAT ATGT AAC ATTT AC TCC AGC ATCT ATTGTATTT 
TAAATTTC TACTCC AGTCC AATGT AACTATTTAGC CCTGG ATT AT AC AT AC TGTCC AATTTTC ATT AAAT 
TTTTGTCTTATAACTATAAAAAAAAAAAAAAAAAAAAAA 

>gi | 22538462 | ref |NM_002793 .2 | Homo sapiens proteasome (prosome, 
macropain) subunit, beta type, 1 (PSMB1) , mRNA 

AAGGCAGCCATCTCGCCGTGAGACAGCAAGTGTCGCGCAGCCGTGCGATGTTGTCCTCTACAGCCATGTA 
TTCGGCTCCTGGCAGAGACTTGGGGATGGAACCGCACAGAGCCGCGGGCCCTTTGCAGCTGCGATTTTCG 
CC C T AC GTTTTC AACGG AGGTACTATACTGGC AATTGC TGG AGAAGATTTTGC AATTGTTGC TTCTG ATA 
CTCGATTGAGTGAAGGGTTTTCAATTCATACGCGGGATAGCCCCAAATGTTACAAATTAACAGACAAAAC 
AGTC ATTGG ATGC AGC GGTTTTC ATGG AG ACTGTCTT AC GCTG AC AAAGATTATTGAAGC AAG AC TAAAG 
ATGTATAAGC ATTCC AATAAT AAGG CC ATGAC T ACGGGGGC AATTGCTGC AATGC TGTCT AC AATCC TGT 
ATTC AAGGCGC TTC TTTCC ATACT ATGTTT AC AAC ATC ATC GGTGG ACTTG ATGAAG AAGGAAAGGGGGC 
TGT AT AC AGCTTTG ATCC AGT AGGGTCTT ACC AG AG AGACTC CTTC AAGGC TGGAGGC TC AGC AAGTGC C 
ATGCTACAGCCCCTGCTTGACAACCAGGTTGGTTTTAAGAACATGCAGAATGTGGAGCATGTTCCGCTGT 
CCTTGGACAGAGCCATGCGGCTGGTGAAAGATGTCTTCATTTCTGCGGCTGAGAGAGATGTGTACACTGG 
GGACGCACTCCGGATCTGCATAGTGACCAAAGAGGGCATCAGGGAGGAAACTGTTTCCTTAAGGAAGGAC 
TGATCTGTGTGCTCTTATCACCAATCAGTTCAGACCTGGTTGATTTTGTACTTTC 
TGGTTTTGTTTATTAAAAGAGAAACCTGAAGT 

>gi | 30152230 | ref | XM_167259 . 2 | Homo sapiens similar to protease 
(prosome. macropain) 26S subunit, ATPase 1 (LOC221890) , mRNA 
ATGCCATC ATGTCTAC ATCTGTGGGCTC AGAGC ATTCTTGT AGAC AAGGATC TGC TGGAACCTGGCTGC T 
CGGTCCTGCTCAACCACAAGGTTCGTGCTGTGATATGGGTGCTGATGGATGACACGGATACCCTAGTCAC 
AATGATGAAGGTGGAAAAGACCCCCCAGGAGACCTGTGTTGATACTCX3GGG 

GAAATTAAGGAAATCATTCTGAATATTATGAAGAGATGGAAATACCTACATGATGGGCCCAAACTCATAT 
GGGAATTGTTTCTAGTTGCTGAAGAACATGCACCTTCCATCATGTTTATTGATGAAATTGATGCTATTAG 
G AC AAAAAG ATGTG AC TC AAATTCTG ATAGTGAGAG AGAAATTC AGCAAATAATGC TGGAAATGTTG AAC 
CAGTTGGATGGATTTGATTCAAGGGGAGATGTGAAAGTTATCATATCCACAAGCCGAATAGAAACTTTGG 
ATC T AGC ACTTATC AG AC C AGGC T AC AC TGACAGG AAGC TCAAGTTCC CCC TGCC TGATG AAAAG ACTAA 
GAAGCACATCTTTCAGATGCACACAAGCAGCATGACACTGGCCAATGATGTAACCCTGGAACGACTTGAC 

ACTTG A 

>gi (27479737 | ref | XM_210528 . 1 | Homo sapiens similar to protease 
(prosome, macropain) 26S subunit, ATPase 1 (Rattus norvegicus) 
(LOC285213) , mRNA 

ATGGGTGAAAGTCAGAGTGGGGGTTATGGTCCTGGAGTTTGCAAGAAGAATGGAAGGACAAGAAAAATAA 
ATATGAACCTCCTGTACCAACTAGAGTGGAGAAGAAGAAAACAAAGGGACCAGATGCTGCCAGCAAACTG 
CCACTGCCACAAGGTGCATGCAGTGATAGGGGTGCTGATGGATGACACGGATCCCCTATTCACAGTGATG 
AAG GTGGAAAAGACTCC CC AGG AG AC C T ATG CTG ATAC TGGGGG AAAAG AAATTAAGGAATC TGTG AAGC 
C TCC TC TC AC C C AAC ATG AATATT ATGAAGAGAC GGGTGTAAAGC CTC CT AC GGGGTGGGGGGTC ATTC T 
C T ATGGTC C AC CTGGC AC AGTTGC TGAAGAAC AC GC ACC ATCC AC TGTGTTTATTGATG ATATTAATGTC 
AC TGGG AC AAAAAG AT ATGACTC AAATTC TGGTGGTG AG AT AGC AATTC AGC GAAC AATGTTGG AAC TGT 
GGAACCAGTTGGATGGATTTGATTCTAGGGGAGATGTGAAAATGCATACAAGCATGATGACACTGGCCGA 
TGATGT AAACC TGG AC GACTTG ATAATG AC TAAAGATG AC C TC TTTGGTGCTG AC ATC AAGGC AATC TGC 
AC AG AAGCTTGCCTG ATGGC C TT AAG AG AGCC AC AG AATG AAGAAAATGTTCTTTGC AAAAAAC AGG AAG 
GCACTCCCAAGGGGCTCTATCTCTAG 

>gi | 29727183 | ref | XM_292848 . 1 | Homo sapiens similar to protease 
(prosome, macropain) 26S subunit, ATPase 1 [Rattus norvegicus] 
(LOC343928) , mRNA 

ATGGGTCAAAGTCAGAGTGGTGGTCATGGTCCTGGAAGTGGCAAGAAGGATGACAAGGACAAAAAAAAGA 
AAT ATG AACC TTCTGT ACC AACTAGAGTGAGG AAAAAG AAGAAGAAAAC AAAGGGAC C AG ATGC AGTC AG 
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CAAACTGCCACTGGTGACACCTCACACTCAGTGCCGGTTAAAATTACTGACATTAGAGAGAATTAAAGAC 
T ATC T TCTC ATGG AGGAAG AATTC ATT AG AAATC AGG AAC AGATG AAAC C ATT AG AAG AAAAG C AAAAGG 
AGGAAAGATC AAAAGTGG ATGATC TGAGGGGG ACCCC G AT AC AAGG ATC TTCTGGAAAC TGGC TGCTC GG 
TCCTGCTC AAC C AC AAGGTGC ATGCCGTGATGGGGTGC TG ATGG ATGACAC GG ATC CCCTGGTC AC AGTG 
ATGAAGATGGAAAAGACCCCCCAGGAGACCTATGCCAATATTGGGGGGCTGGACAACCAAATTCAGGAAA 
TAAAGGAAGC TGTGGAGTTTCC TCTCACCC ATCCTGAATATTATGAAGAGATGGGT AAAAAGCCTC CT AA 
GGGGGTC ATTC TC T ATGGTC C ACCTGGCAC AGGC AAAACCTTGTTATCC AAAGC AGTAGC AAACC AAACC 
TCAGCCACTTTCTTGAGAGTGGTTGGCTCTGAACTTATTCAGAAGTACCTAtf 
TATCCATCATGTTTACTGATGAAATTGGAGCCATTGGGACAAAAAGATATGACTCAAATTC 
CATTGACAGGAAGATCAAGTTCTCCCTGCCTGATGAAAGGACTAAGAAGCGTATCTTTCAGAATCACACA 
AGCAGGATGACACTGGCCGATGAAGTAACCCTGGACGACTTGATCATGGTTAAAGATGACCTCTCTGGTG 
CTGACATCAAGGCAATCTGTACAGAAGCTGGTCTGATGGCCTCAAGAGAACGTAGAATGAAAGTAACGAA 
TG AATTC TTC AAAAAAT ATAAAG AAATGTTCTTT ATAAGAAACAGGAAGGC ACC CCTG AGGGGC TC T ATC 
TCTAGTGAACCACAGCTGCCATCAGGAAAATGGTTGGGCGATTCCTCGACCCCTGAAAAGGATGAGCAAC 
TTGTTCC C AAAGCTGG AG AAG AC ACTCC TG AGGGC T ATTTAGG AC AACTT ATG ACTC AGCTC TTTG AGC A 
GAAAG AGGCC AAAAAGTTC AGC AG AAAAAGC CC TG AAC TC TTGG AAG AGCTGGC TTC AAGCC TGGC TTAG 

>gi (29736609 | ref|XM_293923 . 1 | Homo sapiens similar to protease 
(prosome, macropain) 26S subunit, ATPase 1 (LOC345645), mRNA 
ATGATAGTTGAGGTGGAGG AAC TTCC AGC AGTGGC AGCTC AAGTGGC C AAG AC AAGAGTGGCGG AAAAAG 
AAGAAAAAACAAAGGGACAAGATGTTGCCAGTAAACTGCCACTGGTGACACTTCACACTCAGTGTCGGTT 
AAAATT AC TGAAGTT AG AG AGAATT AAAG ACTAC CTTCTC ATGGTGG AAG AATTC ATTAGAAATC AGGAA 
CAAATAAAACT ATT AG AAG AAAAGC AAG AG GAGGGAAG ATC AAAAGTGGATG ATC TGAGGGGG AC CCCAA 
TGTCAGTAGGAAACTTGGAAGAGATCATCGATGACAATCATGCCATTGTGTCTACATCTGTGGGCTCAGA 
ACACTATGACAGCATTATTTCATTTGTAGAGAAGGATCTGCTGGAACCTGGCTGCTCGATTCTGCTCAGA 
CACAAGGTACATGCGGTGATAGGGGTGCTGATGGATGATACGGGTCCCCTGGTCACAATGATGAAGGTGG 
AGAAGGCC C CC C AGG AG ACC T ATGTCAAT ACTGGGGGGTTGG AC AACC AAATTC AGG AAATTAAGG AATC 
TATGGAGCTTCCTCTCCCCC ATCCTGAATATTATGAAGAGATGGGT AC AAAGC CTCCTAAAGGGGTC ATT 
CTCTGTGGTCC AC C TGGC AC AGC C ACTTTC TTGAG AGTGGTTGGCTC TG AAC TT ATTC AGAAGT AC CT AG 
GTGATGGGCCCAAACTCGTACGGCAAGTATTTCAAGTTGCTGAAGAACATGCACCATCCATCATGTTTA^ 
TGATGAAATTG AAGCC ATTGGGAC AAAAAG ATATGACTC C AATTC TGGTGGTGAG AG AG AAATTC AGC AA 
AC AATGTTGGAATTGG AAC TGTTG AACC AATTGGGTGGATTTG ATTC TAGGGAAGATGTGAAAGTTATC A . 
TGGCCACAAAACAAGTAGAAACTTTGGATCCAGTACTTATCAGACCAGGCCGCATTGACAAGAAGATCGA 
GTTC C ACCTGCC TG ATGAAAAG AC T AAG AAGC AC ATC TTTCAGATTC AC AC AAGCAGGATG AC ACTGGCC 
AATG ATGTAAC C CTGG ACG AC TTGATC ATGGCTAAAG ATG ACTTCTC TGGTGCTGAC ATC AAGGCAATCT 
GT AC AG AAGC TG GTC T G ATGG C C T T AAG AGAAC AT AG AATGAAAGC AAC AAATG AAG AC TTC AAAAAATC 
T AT AG AAAGTGTTC TTT AT AAG AAAC AC GAAGGC ATCCC TGAGGGGCTTTATCTCTAG 

>gi | 29740163 | ref | XM_294034 . 1 | Homo sapiens similar to protease 
(prosome, macropain) 26S subunit, ATPase 1 [Rattus norvegicus] 
(LOC345986) , mRNA 

ATGTCAGTAGGAACCTTGGAAGAGATCAATGACAGTCATGCCATCGTGTCTACATCTGTGGGCTCAGGAC ' 
ACTACGCCAACATTCTTTCATTTGTAGACAAGGATCTGCTGGCAACTGGCTGCTCAGTCCTGCTCAATCA 
CAAGGTGCATGCCATGAGAGGGGTGCTGATGGATGACGTGGATCCCCTGGTCACAGTGATGAAGGTGGAA 
AAGGTC CCCC AGG AGAC CTATGCC AAT AC TGGGAGGTTGGATAATC AAATTC AGG AAATTAAGGAATC TG 
TGGAGC TTCCTCTCACCCATGC TG AAC ATT ATG AAGAGATGGGT AT AAAGC CTCCTAAGGGGGTCATTCT 
CTATGGTCCACCTGGC AT AGTTAAAACCTTGCTAGCCAAAGCGGTAGC AAACC AAACC TC AGC C ACTTTC 
TTG AAAGTGGTTGGCTCTGAAC TTATTC AGAAGT ACC TAGGTGATGGGC C C AAACTTGT ACGGGAATTGT 
TTCAAGTTGCTGGAGAACATGCACCATCCATCGTGTTTACTCATGAAACTGACGCCATTGGGACAAAAAG 
ATATGACTCAAATTCTGGTGAAGCTGGTCTGATGGCCTTAAGAGAACGTAGAATGAAAGCAACAAATGAA 
GACTTCAAAAAATCTAAAGAAAATGTTCTTTATAAGAAACAGGAAGGCACCCCTGAGGGGCTGTCTCTCT 

AG 

>gi [29737531 | ref |XM_294368.1 t Homo sapiens similar to protease 
(prosome, macropain) 26S subunit, ATPase 1 (LOC346682), mRNA 
ATGG TTC AAAGTCAGAGTGGTGGTCATGGTCCTGGAGGTGGAAAGAAGGATGACAAAGACAAGAAAAAGA 
AATAC AAACCTC CTGT ACC AACTAGAGTGGGG AAAAAG AAGAAGAAAAC AG AGGG ATC AT ATGC TGCC AG 
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C AAAC TGCC ACTGGTGAC ACC TC AC AC TC AGTGCCAGTTAAAATTACTG AAAAAAC AG AAACAAATGG AA 
CCACTAGAAGAAAAGCAAGAGGAGGAAAGATCAAAAGTGGATGATCTTAGGGGGACCCCGATGTCAGTAG 
G AACC TTGGAAG AG ATC ATC AATGAC AGTCATGCC ACTGTGTCTAC ATCTGTGGGCTC AGAACGCTATGT 
CAGCATTCTTTCATTTGTAGACAAGGATCTGCTGGAACCTGGCTGCTTAGTCCTGCTCAACCACAAGGTG 
CATGCCATGATAGGGGTGCTGATGGATAACATGGATCCCCTGGTCACAGTGATGAAGGGGTTGGACAACC 
AAAGTCAGGAAATTAAGGAATCTGTGGAGCTTCCTCTCACTCATCCTGAATATTATGAAGAGATGGGTAT 
AAAGCCCCCTAAGGGGGTCATTCACTATGGTCCACCTGGCACAGAATATGCACTGTCCATCATGTTTATT 
GATGAAATTGACGCAATTGGGACAAAAAGATATGACTCAAATTCTGGTAGTGAGAGAGAAATTCAGCGAT 
C AGC ATTGGAACTGTTGAACC AGTC AGATGG ATTTG ATTC TAGGGGAGATGTGAAAGTT ATC ATGGC C AC 
AAAC CAAATAG AAACTTTGGATC C AGC ACTTATC AGAC C AGGCTGC ATTGAC AGG AAG ATTGAGTTC CCC 
CTTCATGATGAAAATACTAAGAAGCGCATCTTTCAGATTCACACAATCAGGATGACGCTGGCCAATGATG 
TAAACCTGG ACG ACTTG ATCGTGACT AGAG ATG AC CTC TCTGGTGCTG AC ATC AAGGC AATC TGT AC AGA 
AGTTGGTCTGATGGCCTCAAGAGAATGTAGAATGCAAGTAAAAAATGAAGACTTCCAAAAGTCTAAAGAA 

AATGTTC TTTAT AAG AAAC AGGAAGGC ACCCCTG AG AGGC TCT ATC TC TAG 

>gi | 24430150 | ref | NM_002802 . 2 | Homo sapiens proteasome (prosome, 
macropain) 26S subunit, ATPase r 1 (PSMCl) , mRNA 

AGTGGTGGAGGAACTTCCGGCAGCGGCAGCTCAAGTGGCCAAGGCAAGATGGGTCAAAGTCAGAGTGGTG 
GTCATGGTCCTGGAGGTGGCAAGAAGGATGACAAGGACAAGAAAAAGAAATATGAACCTCCTGTACCAAC 
TAGAGTGGGGAAAAAGAAGAAGAAAACAAAGGGACCAGATGCTGCCAGCAAACTGCCACTGGTGACACCT 
CACACTCAGTGCCGGTTAAAATTACTGAAGTTAGAGAGAATTAAAGACTATCTTCTCATGGAGGAAGAAT 
TC AT T AGAAATC AG G AAC AAATGAAAC C ATT AG AAG AAAAGC AAG AGG AGG AAAG ATCAAAAGTGGATG A 
TC TG AGGGGGACCCCG ATGTC AGTAGG AAC CTTGGAAGAG ATC ATTGATG AC AATC ATGC CATC GTGTC T 
AC ATCTGTGGGCTC AG AAC AC TACGTC AGC ATTCTTTC ATTTGTAGAC AAGGATC TGC TGGAACC TGGCT 
GCTCGGTCCTGCTCAACCACAAGGTGCATGCCGTGATAGGGGTGCTGATGGATGACACGGATCCCCTGGT 
CACAGTGATGAAGGTAGAAAAGGCCCCCCAGGAGACCTATGCAGATATTGGGGGGTTGGACAACCAAATT 
C AGG AAATT AAGGAATC TGTGG AGCTTC C TC TC ACC CATC C TG AATATTATG AAGAG ATGGCf AT AAAGC 
CTCCTAAGGGGGTCATTCTCTATGGTCCACCTGGCACAGGTAAAACCTTGTTAGCCAAAGCAGTAGCAAA 
CC AAACCTC AGCCACTTTCTTG AG AGTGGTTGGC TC TGAAC TTATTC AG AAGT AC C TAGGTGATGGGCCC 
AAACTCGT AC GGGAATTGTTCC GAGTTGC TGAAG AAC ATGC ACCGTCC ATC GTGTTTATTGATGAAATTG 
ACGCCATTGGGACAAAAAGATATGACTCCAATTCTGGTGGTGAGAGAGAAATTCAGCGAACAATGTTGGA 
ACTGCTGAACCAGTTGGATGGATTTGATTCTAGGGGAGATGTGAAAGTTATCATGGCCACAAACCGAATA 
GAAACTTTGGATCCAGCACTTATCAGACCAGGCCGCATTGACAGGAAGATTGAGTTCCCCCTGCCTGATG 
AAAAG ACG AAG AAGC GC ATC TTTC AGATTC AC AC AAGC AGGATGAC GCTGGCTGATGATGT AAC CCTGG A 
CGACCTGATCATGGCTAAAGATGACCTCTCTGGTGCTGACATCAAGGCAATCTGTACAGAAGCTGGTCTG 
ATGGCCTTAAGAGAACGTAGAATGAAAGTAACAAATGAAGACTTCAAAAAATCTAAAGAAAATGTTCTTT 
ATAAGAAACAGGAAGGCACCCCTGAGGGGCTGTATCTCTAATGAACCATGGCTGTCATCAGGAAAATGGT 
TGGGAGATTTCTCAATCCCTGAAAGGGATGAGGTTGGGGGAGTTGCCCAGAGGAATCCCTGTTCCCACTG 

ATTTTTATTAGCAAAACATCCTGTGTCTTTTGG 
TCACTGTGCAGCAGTCTGCTTCCCAATAAAGCGTGCTCTTTCACAA 

>gi | 25777601 | ref |NM_002808 . 3 | Homo sapiens proteasome (prosome, 
macropain) 26S subunit, non-ATPase, 2 ( PSMD2 ) , mRNA 

TGCGCGCGCAGCGGGCCGGCAGTGGCGGCGGAGATGGAGGAGGGAGGCCGGGACAAGGCGCCGGTGCAGC 
CCCAGCAGTCTCCAGCGGCGGCCCCCGGCGGCACGGACGAGAAGCCGAGCGGCAAGGAGCGGCGGGATGC 
CGGGGACAAGGACAAAGAACAGGAGCTGTCTGAAGAGGATAAACAGCTTCAAGATGAACTGGAGATGCTC 
GTGGAAC G AC TAGGGGAGAAGG AT AC ATC C C TGT ATCG ACC AGCGC TGGAGGAATTGCGAAGGC AGATTC 
GTTCTTCTACAACTTCCATGACTTCAGTGCCCAAGCCTCTCAAATTTCTGCGTCCACA 
GAAGGAAATCTATGAG AAC ATGGC C C C TGGGG AG AATAAGC GTTTTGC TGC TG AC ATC ATC TC C GTTTTG 
GCCATGACCATGAGTGGGGAGCGTGAGTGC CTC AAGTATCGGC TAGTGGGC TC C C AGG AGG AATTGGC AT 
CATGGGGTCATGAGTATGTCAGGCATCTGGCAGGAGAAGTGGCTAAGGAGTGGCAGGAGCTGGATGACGC 
AGAGAAGGTCCAGCGGGAGCCTCTGCTCACTCTGGTGAAGGAAATCGTCCCCTATAACATGGCCCACAAT 
GCAG AGCATG AGGCTTGC G ACCTGC TTATGGAAATTG AGC AGGTGGACATGCTGGAGAAGGAC ATTGATG 
AAAATGC AT ATGC AAAGGTC TGCCTTTATC TC ACC AGTTGTGTG AATTACGTGCC TGAGCCTGAGAACTC 
AGCCCTACTGCGTTGTGCCCTGGGTGTGTTCCGAAAGTTTAGCCGCTTCCCTGAAGCTCTGAGATTGGCA 

TTGATGCTC AATGAC ATGGAGTTGGTAG AAG AC ATCTTC ACCTC C TGC AAGGATGTGGT AGTAC AGAAAC 
AGATGGCATTCATGCTAGGCCGGCATGGGGTGTTCCTGGAGCTGAGTGAAGATGTCGAGGAGTATGAGGA 
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CCTGACAGAGATCATGTCCAATGTACAGCTCAACAGCAACTTCTTGGCCTTAGCTCGGGAGCTGGACATC 

ATCGAGCCCAAGGTGCCTGATGACATCTACAAAACCCACCTAGAGAACAACAGGTTTGGGGGCAGTGGCT 

CTCAGGTGGACTCTGCCCGCATGAACCTGGCCTCCTCTTTTGTGAATGGCTTTGTGAATC 

C C AAG AC AAGCTGC T AAC AGATG ATGGC AAC AAATGGCTTTAC AAGAAC AAGG ACCACGG AATGTTG AGT 

GCAGCTGCATCTCTTGGGATGATTCTGCTGTGGGATGTGGATGGTGGCCTCACCCAGATTGACAAGTACC 

TGTACTCCTCTGAGGACTACATTAAGTCAGGAGCTCTTCTTGCCTGTGGCATAGTGAACTCTGGGGTCCG 

GAATG AGTGTG AC CC TGCTC TGGC ACTGCTC TC AG ACT ATGTTCTCCAC AAC AGC AAC AC C ATGAGACTT 

GGTTCCATCTTTGGGCTAGGCTTGGCTTATGCTGGCTCAAATCGTGAAGATGTCCTAACACTGCTGCTGC 

CTGTGATGGGAGATTCAAAGTCCAGCATGGAGGTGGCAGGTGTCACAGCTTTAGCCTGTGGAATGATAGC 

AGTAGGGTC CTGC AATGG AG ATGTAAC TTC C ACT ATCC TTC AGACC ATC ATGG AG AAGTC AGAGACTG AG 

CTCAAGGATACTTATGCTCGTTGGCTTCCTCTTGG 

TCGAGGCAATCCTGGCTGCACTGGAGGTTGTGTCAGAGCCATTCCGCAGTTTTGCCAACACACTGGTGGA 
TGTGTGTGC ATATGC AGGCTCTGGGAATGTGCTG AAGGTGC AGC AGCTGC TC C ACATTTGTAGCGAAC AC 
TTTGACTCCAAAGAGAAGGAGGAAGACAAAGACAAGAAGGAAAAGAAAGACAAGGACAAGAAGGAAGCCC 
CTGCTGACATGGGAGCACATC^GGGAGTGGCTGTTCTGGGGAT^ 

TGGTGCAGAGATGGCATTACGAACCTTTGGCCACTTGCTGAGATATGGGGAGCCTACACTCCGGAGGGCT 

GTACC TTTAGC ACTGGCC CTCATCTCTGTTTC AAATCC AC GAC TC AAC ATCCTGG AT ACCC TAAGC AAAT 

TCTCTCATGATGCTGATCCAGAAGTTTCCTATAACTCCATTTTTGCCATGGGCATGGTGGGCAGTGGTAC 

CAATAATGCCCGTCTGGCTGCAATGCTGCGCCAGTTAGCTCAATATCATGCCAAGGACCCAAACAACCTC 

TTCATGGTGCGCTTGGCACAGGGCCTGACACATTTAGGGAAGGGCACCCTTACCCTCTGCCCCTACCACA 

GCGACCGGCAGCTTATGAGCCAGGTGGCCGTGGCTGGACTGCTCACTGTGCTTGTCTCTTTCCTGGATGT 

TCGAAACATTATTCTAGGCAAATCACACTATGTATTGTATGGGCTGGTGGCTGCCATGCAGCCCCGAATG 

CTGGTTAC GTTTG ATG AGGAGCTGCGGCC ATTGCC AGTGTC TGTCC GTGTGGGCC AGGC AGTGG ATGTGG 

TGGGCC AGGC TGGC AAGCC GAAG ACTATC AC AGGGTTCC AGACGC AT AC AACCCC AGTGTTGTTGGC CC A 

CGGGGAACGGGCAGAATTGGCCACTGAGGAGTTTCTTCCTGTTACCCCCATTCTGGAAGGTTTTGTTATC 

CTTCGGAAGAACCCCAATTATGATCTCTAAGTGACCACCAGGGGCTCTGAACTGCAGCTGATGTTATCAG 

CAGGCCATGCATCCTGCTGCCAAGGGTGGACACGGCTGCAGACTTCTGGGGGAATTGTCGCCTCCTGCTC 

TTTTGTTACTGAGTGAGATAAGGTTGTTCAATAAAGACTTTTATCCCCAAGGAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

>gi | 19718756 | ref |NM_006267 . 3 1 Homo sapiens RAN binding protein 2 
(RANBP2) , mRNA 

CACAGTGGTCCTCCGCCGGCTACGGCGCTGCGTCACTGGTTTGCAGGCGCTTTCCTCTTGGAAGTGGCGA 
CTGCTGCGGGCCTGAGCGCTGGTCTCACGCGCCTCGGGAGCCAGGTTGGCGGCGCGATGAGGCGCAGCAA 
GGCTGACGTGGAGCGGTACATCGCCTCGGTGCAGGGCTCCACCCCGTCGCCTCGACAGAAGTCAATGAAA 
GG ATTCTATTTTGCAAAGC TGTATTATGAAGCTAAAGAAT ATG ATC TTGC TAAAAAAT AC AT ATGTACTT 
ACATTAATGTGCAAGAGAGGGATCCCAAAGCTCACAGATTTCTGGGTCTTCTTTATGAATTGGAAGAAAA 
C AC AG AC AAAGC CGTTG AATGTT AC AGGC GTTC AGTGG AATTAAACCC AACACAAAAAG ATC TTGTGTTG 
AAG ATTGCAGAATTGC TTTGTAAAAATGATGTTACTG ATGG AAGAGCAAAATACTGGC TTGAAAG AGC AG 
CCAAACTTTTCCCAGG AAGTC CTGCAATTTATAAACTAAAGGAACAGCTTCT AG ATTGTGAAGGTGAAGA 
TGG ATGG AAT AAACTTTTTGACTTGATTC AGTC AG AACTTT ATGTAAGACCTG ATG AC GTCC ATGTGAAC 
ATCCGGCTAGTGGAGGTGTATCGCTCAACTAAAAGATTGAAGGATGCTGTGGCCCACTGCCATGAGGCAG 
AGAGGAACATAGCTTTGCGTTCAAGTTTAGAATGGAATTCGTGTGTTGTACAGACCCTTAAGGAATATCT 
GGAGTCTTTACAGTGTTTGGAGTCTGATAAAAGTGACTGGCGAGCAACCAATACAGACTTACTGCTGGCC 
TATGCTAATCTTATGCTTCTTACGCTTTCCACTAGAGATGTGCAGGAAAGTAGAGAATTACTGCAAAGTT 
TTGATAGTGCTCTTC AGTCTGTGAAATC TTTGGGTGG AAATG ATGAACTGTC AGC TAC TTTCTT AGAAAT 
G AAAGG AC ATTTCTAC ATGC ATGCTGGTTCTCTGC TTTTG AAG ATGGGTC AGC AT AGTAGTAATGTTC AA 
TGGCGAGCTCTTTCTGAGCTGGCTGCATTGTGCTATCTCATAGCATTTCAGGTTCCAAGACCAAAGATTA 
AATTAATAAAAGGTG AAGCTGGAC AAAATCTGCTGG AAATGATGGC C TGTG AC C GACTG AGCC AATC AGG 
GC AC ATGTTGCTAAACTT AAGTC GTGGC AAGC AAGATTTTTT AAAAGAG ATTGTTGAAACTTTTGC C AAC 
AAAAGC GGGCAGTCTGC ATT AT ATGATGC TC TGTTTTC TAGTC AGTC AC C TAAGGATAC ATC TTTTCTTG 
GT AGC G ATG AT ATTGG AAAC ATTG ATGTACG AG AAC CAGAGC TTG AAGATTTGACTAGATACG ATGTTGG 
TGCTATTCGAGCACATAATGGTAGTCTTCAGCACCTTACTTGGCTTGGCTTA<^GTGGAATTCATTGCCT 
GCTTTACCTGGAATCCGAAAATGGCTAAAACAGCTTTTCCATCATTTGCCCCATGAAACCTCAAGGCTTG 
AAACAAATGCACCTGAATCAATATGTATTTTAGATCTTGAAGTATTTCTCCTTGGAGTAGTATATACCAG 
CC ACTTAC AATT AAAGGAG AAATGT AATTC TC ACC AC AGC TCC TATCAGCCGTT ATGCC TGC CC CTTC C T 
GTGTGTAAACAGCTTTGTACAGAAAGACAAAAATCTTGGTGGGATGCGGTTTGTACTCTGATTCACAGAA 
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AAGC AGT AC C TG GAAAC GT AGC AAAATTG AG AC T TC TAG TTC AGC ATG AAAT AAAC AC TC T AAG AG C CC A 
GGAAAAACATGGCCTTCAACCTGCTCTGCTTGTACATTGGGCAGAATGCCTTCAGAAAACGGGCAGCGGT 
CTTAATTCTTTTTATG ATC AAC GAGAATAC AT AGGG AGAAGTGTTC ATTATTGG AAG AAAGTTTTGC C AT 
TGTTGAAGATAATAAAAAAGAAGAACAGTATTCCTGAACCTATTGATCCTCTGTTTAAACATO 
TGTAGACATTCAGGCATCAGAAATTGTTGAATATGAAGAAGACGCACACATAACTTTTGCTATAT^ 
GCAGTAAATGGAAATATAGAAGATGCTGTGACTGCTTTTGAATCTATAAAAAGTGTTGTTTCTTATTGGA 
ATCTTGCACTGATTTTTCACAGGAAGGCAGAAGACATTGAAAATGATGCCCTTTCTCCTGAAGAACAAGA 
AG AATGC AAAAATTATC TGAGAAAGACCAGGGACTACCTAATAAAG ATT ATAGATGACAGTGATTC AAAT 
CTTTC AGTGGTC AAG AAATTGC C TGTGCCCCTGGAGTCTGT AAAAG AGATGCTTAATTC AGTCATGC AGG 
AACTCGAAGACTATAGTGAAGGAGGTCCTCTCTATAAAAATGGTTCTTTGCGAAATGCAGATTCAGAAAT 
AAAAC ATTC T ACACC GTC TCCT ACC AAATATTC ACTATC ACC AAGTAAAAGTTAC AAGT ATTC TCCC AAA 
AC ACC ACC TCG ATGGGC AGAAG ATC AGAATTC TTT AC TGAAAATGATTTGCC AAC AAGT AG AG GC CATTA 
AGAAAGAAATGCAGGAGTTGAAACTAAATAGCAGTAACTCAGCATCCCCTCATCGTTGGCCCACAGAGAA 
TTATGGACCAGACTCGGTGCCTGATGGATATCAGGGGTCACAGACATTTCATGGGGCTCCACTAACAGTT 
GC AACTACTGGCCC TTC AGT ATATTATAGTC AGTC ACC AGC AT AT AATTCCC AGT ATC TTCTC AG AC C AG 
CAGCTAATGTTACTCCCACAAAGGGCCCAGTCTATGGCATGAATAGGCTTCCACCCCAACAGCATATTTA 
TGC C TATCCGC AAC AGATGCAC AC AC CGCC AGTGC AAAGC TC ATC TGC TTGT ATGTTC TCT C AGG AGATG 
TATGGTCCTCCTGCATTGCGTTTTGAGTCTCCTGCAACGGGAATTCTATCGCCCAGGGGTGATGATTACT 
TTAATTAC AATGTTCAAC AGAC AAGC AC AAATCC ACCTTTGCC AGAACC AGGAT ATTTC AC AAAACCTC 
GATTGC AGC TC ATGCTTC AAG ATC TGCAGAATC TAAGAC T AT AG AATTTGGGAAAACTAATTTTGTTC AG 
CC C ATGC CGGGTG AAGG ATT AAGGCC ATC TTTGC C AAC AC AAGC AC AC AC AAC AC AGC C AACTCCTTTTA 
AATTTAACTCAAATTTCAAATCAAATGATGGTGACTTCACGTTTTCCTCACCACAGGTTGTGACACAGCC 
CCCTCCTGC AGCTT AC AGT AAC AGTGAAAGC C TTTTAGGTC TCC TGAC TTC AGATAAACCCTTGCAAGG A 
GATGGCTATAGTGGAGCCAAACCAATTCCTGGTGGTCAAACCATTGGGCCTCGAAATACATTCAATTTTG 
GAAGCAAAAATGTGTCTGGAATTTCATTTACAGAAAACATGGGGTCGAGTCAGCAAAAGAATTCTGGTTT 
TCGGC G AAGTGATGAT ATGTTTACTTTCC ATGGTCC AGGGAAATC AGTATTTGG AAC ACCC ACTTT AG AG 
ACAGCAAACAAGAATCATGAGACAGATGGAGGAAGTGCCCATGGGGATGATGATGATGACGGTCCTCACT 
TTG AGC CTGT AGT AC CTC TTCC TGATAAGATTG AAGTAAAAACTGGTGAGGAAG ATG AAG AAGAATTCTT 
TTGCAACCGCGCGAAATTGTTTCGTTTCGATGTAGAATCCAAAGAATGGAAAGAACGTGGGATTGGCAAT 

GTAAAAAT AC TGAGGC AT AAAAC ATC TGGTAAAATTCGCC TTCTAATG AGAC GAG AGC AAGTATTG AAAA 
TCTGTGCAAATCATTACATCAGTCCAGATATGAAATTGACACCAAATGCTGGATCAGACAGATCTTTTGT 
ATGGCATGCCCTTGATTATGCAGATGAGTTGCCAAAACCAGAACAACTTGCTATTAGGTTCAAAACTCCT 
GAGGAAGC AGC AC TTTTT AAATGC AAGTTTGAAGAAGCCC AGAGC ATTTTAAAAGC CCC AGGAAC AAATG 
TAGCCATGGC GTC AAATC AGGCTGTC AGAATTGTAAAAG AACCCACAAGTC ATGATAAC AAGG ATATTTG 
CAAATCTGATGCTGGAAACCTGAATTTTGAATTTCAGGTTGCAAAGAAAGAAGGGTCTTGGTGGCATTGT 
AAC AGC TGC TC ATT AAAG AATG C TTC AAC TG C T AAG AAATG TGT ATC ATG C C AAAATC T AAAC C C AAGC A 
AT AAAG AGC TCGTTGGC C C ACC ATTAGC TGAAACTGTTTTT ACTCCT AAAACC AGC CC AGAGAATGTTCA 
AG ATC GATTTGCATTGGTGACTCC AAAG AAAG AAGGTC ACTGGG ATTGTAGT ATTTGTTT AGTAAG AAAT 
G AAC C TACTGT ATC T AGGTGC ATTGCGTGTC AG AAT AC AAAATCTGCT AAC AAAAGTGG ATCTTC ATTTG 
TTC ATC AAGCTTC ATTTAAATTTGGC C AGGG AG ATC TTC CTAAACCTATTAACAGTG ATTTC AG ATC TGT 
TTTTTCTACAAAGGAAGGACAGTGGGATTGCAGTGCATGTTTGGTACAAAATGAGGGGAGCTCTACAAAA 

TGTGC TGCTTGTCAGAATCCG AGAAAAC AGAGTC TAC CTGC T AC TTCT ATTCC AAC AC C TGC CTCTTTT A 
AGTTTGGTACTTCAGAGACAAGTAAAACTCTAAAAAGTGGATTTGAAGACATGTTTGCTAAGAAGGAAGG 
ACAGTGGGATTGCAGTTCATGCTTAGTGCGAAATGAAGCAAATGCTACAAGATGTGTTGCTTGTCAGAAT 
CCGGATAAACCAAGTCCATCTACTTCTGTTCCAGCTCCTGCCTCTTTTAAGTTTGGTACTTCAGAGACAA 
GC AAGGCTCC AAAGAGC G G ATTTG AGG GAATGTTC AC TAAGAAGGAGGG AC AGTGGG ATTGC AGTGTGTG 
C TTAGT AAGAAATGAAGC C AGTGC TAC CAAATGTATTGCTTGTCAGAATCCAGGT AAAC AAAATC AAAC T 
ACTTCTGCAGTTTCAACACCTGCCTCTTCAGAGACAAGCAAGGTCTCCAAAGAGCGGATTTGAGGGAATGT 
TCACTAAGAAGGAGGGACAGTGGGATTGCAGTGTGTGCTTAGTAAGAAATGAAGCCAGTGCTACCAAATG 
TATTGCTTGTCAGAATCCAGGTAAACAAAATCAAACTACTTCTGCAGTTTCAACACCTGCCTCTTCAGAG 
ACAAGCAAGGCTCCAAAGAGCGGATTTGAGGGAATGTTCACTAAGAAGGAAGGACAGTGGGATTGCAGTG 
TGTGCTTAGTAAGAAATGAAGCCAGTGCTACCAAATGTATTGCTTGTCAGTGTCCAAGTAAACAAAATCA 
AAC AAC TG C AATTTC AAC AC C TGC C TC TTCGGAGATAAGC AAG GC TCC AAAG AGTGG ATTTG AAGG AATG 
TTC ATC AGG AAAGGAC AGTGGGATTGTAGTGTTTGC TGTGTAC AAAATGAG AGTTCTTCC TTAAAATGTG 
TGGC TTGTG ATGCCTC TAAACCAACTC ATAAACC TATTGC AG AAGCTC C TTC AGC TTTC AC AC TGGGCTC 
AG AAATG AAGTTGCATGAC TC TTC TGGAAGTC AGGTGGG AAC AGG ATTTAAAAGTAATTTC TC AG AAAAA 
GC TTC TAAGTTTGGCAATACAGAGC AAGG ATTC AAATTTGGGCATGTGGATC AAG AAAATTCACCTTC AT 
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TT ATGTTTC AGGGTTC TTCTAAT AC AGAATTT AAGTC AACC AAAGAAGG ATTTTC C ATCCCTGTGTC TGC 

TGATGGATTTAAATTTGGCATTTCGGAACCAGGAAATCAAGAAAAGAAAAGTGAAAAGCCTCTTGAAAAT 

GGTACTGGCTTCCAGGCTCAGGATATTAGTGGCCAGAAGAATGGCCGTGGTGTGATTTTTGGCCAAACAA 

GTAGCACTTTTACATTTGCAGATCTTGCAAAATCAACTTCAGGAGAAGGATTTCAGTTTG 

CCCCAATrrCAAGGGATTTTCAGGTGCTGGAGAAAAATTATTCTCATCACAATACGGTAAAATGGCCAAT 

AAAGC AAACACTTCCGGTGAC TTTG AG AAAGATG ATG ATGCC T AT AAG ACTGAGGAC AGCG ATG ACATCC 

ATTTTGAACCAGTAGTTCAAATGCCCGAAAAAGTAGAACTTGTAACAGGAGAAGAAGATGAAAAAGTTCT 

GTATTCACAGCGGGTAAAACTATTTAGATTTGATGCTGAGGTAAGTCAGTGGAAAGAAAGGGGCTTGGGG 

AACTTAAAAATTCTCAAAAACGAGGTCAATGGCAAACTAAGAATGCTGATGCGAAGAGAACAAGTACTAA 

AAGTGTGTGCTAATCATTGGATAACGACTACGATGAACCTGAAGCCTCTCTCTGGATCAGATAGAGCATG 

GATGTGGTTAGC C AGTGATTTC TCTGATGGTG ATGC C AAAC TAGAGCAGTTGGC AGC AAAATTT AAAAC A 

CCAGAGCTGGCTGAAGAATTCAAGCAGAAATTTGAGGAATGCCAGCGGCTTCTGTTAGACATACCAC 

AAAC TCCCC AT AAACTTGTAGAT AC TGGC AG AGCTGCC AAGTT AATAC AGAGAGC TGAAGAAATG AAGAG 

TGGACTGAAAGATTTCAAAACATTTTTGACAAATGATCAAACAAAAGTCACTGAGGAAGAAAATAAGGGT 

TCAGGTACAGGTGCGGCCGGTGCCTCAGACACAACAATAAAACCCAATCCTGAAAACACTGGGCCCACAT 

TAGAATGGGATAACTATGATTTAAGGGAAGATGCTTTGGATGATAGTGTCAGTAGTAGCTCAGTACATGC 

TTCTCCATTGGCAAGTAGCCCTGTGAGAAAAAATCTTTTCCGTTTTGGTGAGTCAACAACAGGATTTAAC 

TTCAGTTTTAAATCTGCTTTGAGTCCATCTAAGTCTCCTGCCAAGTTGAATCAGAGTGGGACTTCAGTTG 

GC AC TGATGAAGAATCTG ATGTT ACTCAAGAAG AAG AGAGAG ATGG AC AGT AC TTTGAACC TGTTGTTC C 

TTTAC CTGATC TAGTTG AAGTATC CAGTGGTGAGG AAAATGAAC AAGTTGTTTTTAGTC AC AGGGC AAAA 

CTCTACAGATATGATAAAGATGTTGGTCAATGGAAAGAAAGGGGCATTGGTGATATAAAGATTTTACAGA 

ATTATGATAATAAGCAAGTTCGTATAGTGATGAGAAGGGACCAAGTATTAAAACTTTGTGCCAATCACAG 

AATAACTCCAGACATGACTTTGCAAAATATGAAAGGGACAGAAAGAGTATGGTTGTGGACTGCATGTGAT 

TTTGC AGATGG AG AAAG AAAAGTAGAGC ATTTAGC TGTTCGTTTT AAACTAC AGGATGTTGC AGACTCGT 

TTAAGAAAATTTTTGATGAAGCAAAAACAGCCCAGGAAAAAGATTCTTTGATAACACCTCATGTTTCTCG 

GTCAAGCACTCCCAGAGAGTCACCATGTGGCAAAATTGCTGTAGCTGTATTAGAAGAAACCACAAGAGAG 

AGGACAGATGTTATTCAGGGTGATGATGTAGCAGATGCAACTTCAGAAGTTGAAGTGTCTAGCACATCTG 

AAAC AAC AC C AAAAGC AGTGGTTTC TCC TCC AAAGTTTGTATTTGGTTC AGAGTC TGTT AAAAGC ATTTT 

T AGT AGTGAAAAATC AAAAC C ATTTGC ATTC GGC AACAGTTC AGCC ACTGOT 

TTT AATGC AC C TTTG AAAAGT AAC AATAG TG AAACT AGTTC AGT AGC CC AG AGTGGATC TG AAAGC AAAG 
TGGAACCTAAAAAATGTGAACTGTCAAAGAACTCTGATATCGAACAGTCTTCAGATAGCAAAGTCAAAAA 
TCTC TTT GCTTCCTTTCCAACGGAAGAATCTTCAATCAACTACACATTTAAAACACCAGAAAAGGC AAAA 
GAGAAGAAAAAACCTGAAGATTCTCCCTCAGATGATGATGTTCTCATTGTATATGAACTAACTCCAACCG 
CTGAGCAGAAAGCCCTTGCAACCAAACTTAAACTTCCTCCAACTTTCTTCTGCTACAAGAATAGACCAGA 
TTATGTTAGTGAAGAAGAGGAGG ATG ATGAAGATTTC G AAAC AGC TG TC AAGAAAC TTAATGG AAAACT A 
TATTTGGATGGCTCAGAAAAATGTAGACCCTTGGAAGAAAATACAGCAGATAATGAGAAAGAATGTATTA 
TTGTTTGGGAAAAGAAACC AAC AGTTGAAGAGAAGGC AAAAGC AG AT ACGTTAAAAC TTCC AC CTAC ATT 
TTTTTGTGGAGTCTGTAGTGATACTGATGAAGACAATGGAAATGGGGAAGACTTTCAATCAGAGCTTCAA 
AAAGTTCAGGAAGCTCAAAAATCTCAGACAGAAGAAATAACTAGCACAACTGACAGTGTATATACAGGTG 
GG ACTG AAGTG ATGGT ACC TTC TTTCTGTAAATCTGAAGAACCTGATTC TATTACC AAATCC ATT AG TTC 
AC C ATC TGTTTCCTC TG AAACT ATGGACAAACC TGTAG ATTTGTC AACTAGAAAGGAAATTGATACAG AT 
TC T AC AAG C C AAGG GG AAAGC AAG AT AG TT TCATTTGGATTTGGAAGT AGC AC AGGGC TC TC AT TT GC AG 
ACTTGGCTTCCAGTAATTCTGGAGATTTTGCTTTTGGTTCTAAAGATAAAAATTTCCAATGGGCAAATAC 
TGGAGCAGCTGTGTTTGGAACACAGTCAGTCGGAACCCAGTCAGCCGGTAAAGTTGGTGAAGATGAAGAT 
GGTAGTGATGAAGAAGTAGTTCATAATGAAGATATCCATTTTGAACCAATAGTGTCACTACCAGAGGTAG 
AAGTAAAATCTGGAGAAGAAGATGAAGAAATTTTGTTTAAAGAGAGAGCCAAACTTTATAGATGGGATCG 
GGATGTC AGTC AGTGGAAGGAGC GCGGTGTTGG AGAT ATAAAGATTC TTTGGC AT AC AATG AAG AATTAT 
T ACC GGATC CTAATGAGAAGAGACCAGGTTTTTAAAGTGTGTGC AAAC CACGTTATTAC TAAAACAATGG 
AATT AAAGC C CTT AAATGTTTC AAAT AATGCTTTAGTTTGGACTGC C TC AGATTATGC TGATGG AG AAGC 
AAAAGTAGAACAGCTTGCAGTGAGATTTAAAACTAAAGAAGTAGCTGATTGTTTCAAGAAAACATTTGAA 
G AATGTC AGC AG AATTTAATG AAACTCC AGAAAGGAC ATGTATC ACTGGC AGC AG AATT ATC AAAGG AGA 
CC AATCC TGTGGTGTTTTTTG ATGTTTGTGCGGAC GGTG AAC C TCT AGGGC GG AT AACTATGGAATTATT 
TTCAAACATTGTTCCTCGGACTGCTGAGAACTTCAGAGCACTATGCACTGGAGAGAAAGGCTTTGGTTTC 
AAG AATTC C ATTTTTC AC AGAG T AATT C C AGAT TTTG TTTG C C AAG G AG GAGAT ATC AC C AAAC ATG ATG 
GAAC AGGC GG AC AGTC C ATTT ATGGAGAC AAATTTG AAGATG AAAATTTTG ATGTG AAAC ATACTGGTCC 
TGGTTTACTATCCATGGCC AATC AAGGCCAGAATACCAAT AATTC TCAATTTGTTAT AAC AC TGAAG AAA 
GCAGAACATTTGGACTTTAAGCATGTAGTATTTGGGTTTGTTAAGGA'rGGCATGGATACTGTGAAAAAGA 
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TTG AATC ATTTG GTTC TCCC AAAGGGTC TGTTTGTC G AAG AAT AAC T ATC AC AGAATGTGG AC AG ATAT A 
AAATC ATTGTTGTTC ATAG AAAATTTC ATCTGT AT AAGC AGTTGG ATTGAAGC TT AGC TATTAC AATTTG 
ATAGTTATGTTCAGCTTTTGAAAATGGACGTTTCCGATTTACAAATGTAAAATTGCAGCTTATAGCTGTT 
GTCACTTTTTAATGTGTTATAATTGACCTTGCATGGTGTGAAATAAAAGTTTAAACACTGGTGAT^ 
GTGTACTTGTGTTTATGTACTCCTGACGTATTAAAATGGAATAATACTAATCTTGTTAAAAGCAATAGAC 
C TC AAAC T ATTG AAGGAATATGAT AT ATGC AATTTAATTTTAATTCCTTTT AAGATATTTGG AC 
CATGGATATACTTACCATTTGAATAAAGGGACCACAACTTGGATAATTTAATTTTAGGTTTGAAATATAT 
TTGGTAATCTTAACTATTGGTGTACTCATTTATGCATAGAGACTCGTTTATGAATGGGTAGAGCCACAGA 
ACGTATAGAGTTAACCAAAGTGCTCTTCTCTAGAATCTTTACACCTCCTGTGTGGTTACAAGTTAACTTT 
GTAAGTAGCGTACCTTCCTTCCTTAAAATATCTAGCTTCCTGTGCCCTTTCATAGATATTCGATTAATTT 
TTAC ATTTTAAAC AAGTTG ACT ATTTC CTTTAGGGGTTTTGTTTCAAAC TTTTC TGTC ATC TGTC TCTAC 
T ACC TC AG AAACTGC AGC TTGGTTCTG ATG AT AG AAATTGAATTTTTCC TTGT AGTTATTGTGAT AAAGT 
ATGAATATTTTTAGAAAGTCTAT AC C ATGTTC TTTC GTTAAAGATTTGC TTT ATAC AAG ATTGTTGC AGT 



>gi|7019339|ref |NP_037486 . 1 | CCR4-N0T transcription complex, subunit 7 
isoform 1; BTG1 binding factor 1; carbon catabolite repressor protein 
(CCR4) -associative factor 1 (Homo sapiens] 

MKKIRQVIRKYNYVAMDTEFPGWARPIGEFRSNADYQYQLLRCNVDLLKIIQLGLTFMNEQGEYPPGTS 
TWQFNFKFNLTEDMYAQDSIELLTTSGIQFKKHEEEGIETQYFAELLMTSGVVIXrEGWWLSFHSGYDFG 
YLIKILTNSNLPEEELDFFEILRLFFPVIYDVKYLMKSCKNLKGGLQEVAEQLELERIGPQHQAGSDSLL 
TGMAFFKMREMFFEDHIDDAKYCGHLYGLGSGSSYVQNGTGNAYEEEANKQS 

>gi | 24496778 | ref | NP_004770 . 3 | CCR4 -NOT transcription complex, subunit 
8; PGK promoter directed over production; P0P2 (yeast homolog) [Homo 
sapiens] 

MPAALVENSQVICE^ASNLEEEMRKIREIVPSYSYIAMDTEFPGVVVRPIGEFRSSIDYQYQLLRCNVD 
LLKIIQLGLTFTNEKGEYPSGINTWQFNFKFNLTEDMYSQDSIDLIjANSGLQFQKHEEEGIDTLHFAEL»Ij 

MT SGWLC DNVKWL S FH S GYDFG YMVKLLTD S RL P E EEHE FLH I LNLF S P S I YBVKYLMKSCKNLKGG LQ 
EVADQLDLQRIGRQHQAGSDSLLTGMAFFRMKELFFEDSIDDAKYCGKLYGLGTGVAQKQNEDVDSAQEK 

MSILAIINNMQQ 

>gi | 4758030 | ref |NP_004362 . 1 | coatomer protein complex, subunit alpha; 
alpha coat protein; xenin [Homo sapiens] 

MLTKFETKSARVKGLSFHPKRPWILTSLHNGVIQLWDYRMCTLIDKFDEHDGPVRGIDFHKQQPLFVSGG 

DDYKIKVWNYlOjRRCLFTLLGHLDYIRTTFFHHEYPWILSASD^ 

AQFHPTEDLWSASLDQTVTIVWDISGLRKKNLSPGAV^^ 

AAFHPTMPLIVSGADDRQVKIWRMNESKAWEV1>TCRGHYNNVSCAVFHPRQELILSNSED 
RTGVQTFRRDHDRFWVTAAHPNLt^FAAGHIX^^IVFK^ 

DVAVT4QLRSGSKFPVFNMSYNPAENAVXIXTRAS^ENSTYDLYTIPKDADSQNPDAPEGKRSSGLTAW 
VARNRFAVXDRMHSLLIKNLKNEITKKVQVPNCDEIFYAGT^ 

VKWIWSADMSHVALLAKHAIVICNRKLDAIXINIHENIRWSGATOESGVFIYTTSNHIKYAVTTGDHGI 

IRTLDLPIWTRVKGNNWCLDRECRPRVLTIDPTEFKFKLALINRKYDEVLHMVR^ 

KKGYPEVALHFVKDEKTRFSI^ECGNIEIALEAAKAIJ)DKNCWEKIXSE^ALLQGNHQIVEMCY 

DKVSFLYLITGNLEKLRKMMKIAEIRKDMSGHYQNALYLGDVSERVRILKNCGQKSLAYLTAATHGLDEE 

AESLKETFDPEKETIPDIDPNAKLLQPPAPIMPLDTNWPLLWSKGFFEGTIASKGKGGAIAADIDIDTV 

GTEGWGEDAELQLDETCFVEATEGLGDDALGKGQEEGGGWDVEEDLELPPELDISPGAAGGAEDGFFVPP 

TKGTSPTQIWCNNSQLPTOHILAGSFETAMRLLHDQVGVIQFGPYKQLFLQTYARGRTTYQALPCLPSMY 

GYPtWNWKDAGLKNGVPAVGLKLNDLIQRLQl^YQLTTVGKFEEAV 

QQLITICREYIVGLSVETERKKI,PKETLE(X3KRICEMAAYFTHSNLQPVHMILVXRTALNLFFKLKNFKT 
AATFARRLLELGPKPEVAQQTRKILSACEKNPTDAYQLNYDMHNPFDICAASYRPIYRGKPVEKCPLSGA 
CYSPEFKGQICRVTTVTEIGKDVIGLRISPLQFR 

>gi | 21618359 | ref | NP_004095 . 3 | fatty acid synthase [Homo sapiens] 
MEEWIAGMSGKLPESENLQEFWDNLIGGVDMOTDDDRRWKAGLYGLPRRSGKLKDLSRFDASFFGVHPK 
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QAHTMDPQLRLLLEVTYEAIVDGGINPDSLRGTHTGVWVGVSGSETSEALSRDPETLVGYSMVGCQRAMM 

ANRLSFFFDFRGPSIAIiDTACSSSLMAI..QNAYQAIHSGQCPAAIVGGINVLLKPNTSVQFLRLGMLSPEG 

TCKAFDTAGNGYCRSEGWAVLLTKKSLARRVYATILNAGTNTDGFKEQGVTFPSGDIQEQLIRSLYQSA 

GVAPESFEYIEAHGTGTKVGDPQELNGITRALCATRQEPLLIGSTKSNMGHPEPASGLAALAKVI.LSLEH 

GLWAPNLHFHS PNPEI PALLDGRLQVVDQPLPVRGGNVGINS FGFGGSNVHI ILRPNTQPP PAPAPHATL 

PRLLRASGRT PEAVQKLLEQGLRH SQDLAFLSMLNDI AAVPATAM P FRGYAVLGGERGGPEVQQVPAGER 

PLWFICSGMGTQWRGMGLSLMRLDRFRDSILRSDEAVKPFGLKVSQLLLSTDESTFDDIVHSFVSLTAIQ 

IGLIDLLSCMGLRPDGIVGHSLGEVACGYADGCLSQEEAVLAAYWRGQCIKEAHLPPGAMAAVGLSWEEC 

KQRCPPAWPACHNSKim^ISGPQAPWEFVEQLRKEGVFAKE\niTGGMAFHSYFMEAIAPPLIiQELKK 

VIREPKPRSARWLSTSIPEAQWHSSLARTSSAEYNVNNLVSPVLFQEALVWVPEHAVVLEIAPHA^ 

LKRGLKPSCTIIPLMKKDIIRDNLEFFLAGIRRLHIiSGIDANPNALFPPVEFPAPRGTPLISPLIKWDHSL 

AWDVPAAEDFPNGSGSPSAAIYNIDTSSESPDHYLVDHTLDGR^FPATGYIiSIVWKTLARPUSLG 

PVVFEDVVLHQATILPKTGTVSLE^mLLEASRAFE^SENGI^WSGKWQWDDPDPRLFDHPESPTPNPT 

EPLFLAQAEVYKELRLRGYDYGPHFQGILEASLEGDSGRLLVnONWSFMDTMLQMSILGSAKHGLYLPT 

RVTAI HID P AT HRQKL YTLQD KAQVADVW S RWL RVT V AGGVH I S GLHTE S APRRQQ EQQV P I L E KFC FT 

PHTEEGCLSERAALQEELQLCKGLVQALQTKVTQQGLKMWPGLDGAQIPRDPSQQELPRLLSAACRLQL 

NGNLQLEUAQVLAQERPKLPEDPLLSGLLDSPALKACLDTAVENMPSLKMKVVEVLAGHGHLYSRIPGLL 

S PH PLLQLS YTATDRH PQALEAAQAELQQHDVAQGQWDPADP AP S ALGS ADLLVCNC AVAAIiGDP A SALS 

NMVAALREGGFLLLHTUjRGHPSGHVAFLTSTEPQYGQGILSQDAWESLFSRVSVRLVGIjKKSFYGSTLF 

UTRRPTPQDSPIFLPVDDTSFRWVESLKGILADEDSSRPVWLKAINCATSGVVGLVNCLRREPGGTLRCV 

LLSNLSSTSHVPEVDPGSAELQKVLQGDLVMNVYRDGAWGAFRHFL^ 

SIRWVCSSLRHAQPTCPGAQLCTVYYASLNFRDIMIATGKI,SPDAIPGKWTSQDSLLGMEFSGRDASGKR 

VMGLVP AKGLAT SVLLS PD FLWDV P SNWTL E EAASV PVVY STAYYALWRGRVR PG ETLLI H SG SGGVGQ 

AAIAIALSLGCRVFTTVGSAEKRAYLQARFPQLDSTSFANSRDTSFEQHVLWHTGGKGVDLVLNSLAEEK 

LQASVRCLATHGRFLEIGKFDLSQNHPLGMAIFLKNVTFHGVLLDAFFNESSADWREVWALVQAGIRDGV 

VRPLKCTVFHGAQVEDAFRYMAQGKHIGKVVVQVLAEEPEAVLKGAKPKLMSAI SKTFC PAHKSYI I AGG 

LGGFGLELAQWLIQRGVQKLVLTSRSGIRTGYQAKQVRRWRRQGVQVQVSTSNISSLEGARGLIAEAAQL 

GPVGGVFl^VVLRDGLLENQTPEFFQDVCKPKYSGTLNLDRVTREACPELDYFVVFSSVSCGRGNAGQS 

NYGFANSAMERICEKRRHEGLPGLAVQWGAIGDVGILVETMSTNDTIVSGTLPQRMASCLEVLDLFLNQP 

HMVLSSFVI^EKAAAYRDRDSQRDLVEAVAHILGIRDLAAV^ 

VLSVREVRQLTLRKLQELSSKADEASELACPTPKEDGLAQQQTQLN^ 

PLFLVHPIEGSTTVFHSLASRLSIPTYGLQCTRAAPLDSIHSLAAYYIDCIRQVQPEGPYRVAGYSYGAC 
VAFEMCSQLQAQQSPAPTHNSLFLFDGSPTYVLAYTQSYRAKLTPGCEAEAETEAICFFVQQFTDMEHNR 
VLEALLPLKGLEERVAAAVDLIIKSHQGIX>RQELSFAARSFYYKLRAAEQYTPKAKYHGNVMLLR^ 
AYGEDLGADYNLSQVCDGKVSVHVIEGDHRTLLEGSGLESIISIIHSSLAEPRVSVREG 

>gi |24234688 |ref |NP_004125. 3 | heat shock 70kDa protein 9B precursor? 
heat shock 70kD protein 9; stress-70 protein, mitochondrial; 75 kDa 
glucose regulated protein; peptide-binding protein 74; mortal in, 
perinuclear; p66-mortalin [Homo sapiens] 

MISASRAAAARLVGAAASRGPTAARHQDSWNGLSHEAFRLVSRRDYASEAIKGAVVGIDLGTTNSCVAVM 
EGKQAKVLENAEGARTTPS WA FTADG ERLVGMPAKRQAVTNPNNTF YATKRLIGRRYDD PEVQKDI KNV 
PFKIVRASNGDAWVEAHGKLYSPSQIGAFVIiMKMKETAEOTLGHTAKNAVITVPAYFOT 
ISGLNVLRVINEPTAAAIAYGLDKSEDKVIAVYDI^GGTFDISILEIQKGVFEVKSTNGDTFLGGEDFDQ 
ALLRHIVKEFKRETGVDLTKDNMALQRVREAAEKAKCELSSSW 

FEGIVTDLIRRTIAPCQKAMQDAEVSKSDIGEVILVGGMTRMPKVQQTVQDLFGRAPSKAVNPDEAVAIG 
AAIQGGVLAGDWDVLLLDVTPLSLGIETLGGVFTKLINRNTTIPTKKSQVFSTAADGQTQVEIKVCQGE 
REMAGDNKLLGQFTLIGIPPAPRGVPQIEVTFDIDANGIVHVSAKDKGTGREQQIVIQSSGGLSKDDIEN 
MVKNAEKYAEEDRRKKERVEAVNMAEGIIHDTET 

ENIRQAASSLQQASLKLFEMAYKKMASEREGSGSSGTGEQKEDQKEEKQ 

>gi|29732424|ref |XP_293066.1| similar to Stress-70 protein, 
mitochondrial precursor (75 kDa glucose regulated protein) (GRP 75) 
(Peptide-binding protein 74) (PBP74) (Mortalin) (MOT) [Homo sapiens] 
MCDKAAIISASRAAAARLRGTAASPGPTAARQQDGWNGLSHEAFRIVSRQDYASEAINGAAGGVYLGTTN 
SCVAVMEGKQAKVLENAEGARTTPSVVAFTADVQKDIKNIPFKIVCASNGDAWVEAHGKLYSPSQIGAFV 
SMKMK ETA ENYI/GHT AKNAV I AVP A Y FNDLQRQ ATKDAGQ I SGLNVLR V INE PTAAAL A YGLDK S KDKV I 
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AVYDLGGGTFDISILEFQKGVFEVKSTNGDTFLGGEDFIXJALLQHIVKEFKRETGVDLTKDNVAFQRVWE 
AAEKAKCELSSSVQTDINLPYLTMDSSGPKHIOTKLTHAQFEGIVTDLIRRTIAPCQKAMQDAE^SKSDI 
GEVILVEPPSKAVNPDEAVAIGAAIQGDVLAGDVTDVLLLDVI PLSLGIETLRGVFTKLINRNTTI PTKK 
SQVFSTAADGQTQVEIKVCQGEREMAGDNKLLGQFTLIGIPPAPRGVPQIKVTFDIDANGIIHVSAKDKG 
TGHEQQIVIQSSGGLSKDDVENMVKNAEKYAKEDQRRKE 

>gi | 20521724 |dbj | BAA76851 .2 | KIAA1007 protein [Homo sapiens] 
LIMHAMAEWYMRGEQYDQAKLSRILDVAQDLKALSMLLNGTPFAFVIDLAA^ 

EHGEPFIQACMTFLKRRCPSILGGLAPEKDQPKSAQLPPETLATMLACLQACAGSVSQELSETILTMVAN 
CSNVMNKARQPPPGVMPKGRPPSASSLDAISPVQIDPI^GMTSLSIGGSAAPHTQSMQGFPPNLGSAFST 
PQS P AKAF P PL ST PNQTTAF SG IGGL S SQL PVGGLGTG S LTG IGTG ALGL P AVNNDP FVQRKLGTSGLNQ 
PTFQQTDLSQVWPEANQHFSKEIDDEANSYFQRIYNHPPHPTMSVDEVLEMLQRFKDSTIKREREVFNCM 
LRNLF E E Y RF F PQ Y PDKELH I T AC L FGG 1 1 EKGLVTYMALGLALRYVLEALRK PFGS KMYYFG I AALDRF 
KNRLKD YPQYCQHLAS I SHFMQF PHHLQEY I EYGQQS RDP PVKMQGS ITT PGS I ALAQAQAQAQV PAKAP 
LAGQVSTMVTTSTTTWAKTVTVTRPTGVSFKKDVPPSINTTO 
IFNX^SQSIWTQKVEELKETVKEEFMPWVSQYLVMKRVSI 

NIKVLLTSDKAAANFSDRSLLKNLGHWLGMITLAKNKPILHTDLDVKSLLLEAYVKGQQELLYWPFVAK 
VTjESSIRSVVFRPPNPOTMAIMNVLAELHQEHDLKL^ 

LDEQLSAPKKDVKQPEEX.PPOITTTTTSTTPATim'CTAWPPQPQYSYHDINVYSLAGI^ 

LFQAHPQLKQCVRQAIERAVQELVHPVVDRSIKIAMTTCEQIVRKDFALDSEESRMRIAAHHMMRNLTAG 

MAM I TC RE PLLMS I STNLKNS FASALRTAS PQQREMMDQAAAQLAQDNCELACCFI QKTAVEKAGPEMDK 

RIATEFELRKHARQEGRRYCDPWLTYQAERMPEQIRLKVGGVDPKQLAVYEEFARNVPGFLPTNDLSQP 

TGFLAQPMKQAWATDDVAQIYDKCITELEQHLHAIPPTLAMNPQAQALRSLLEWVLSRNSRDAIAALGL 

LQKAVEGLLDATSG ADADLLLRYRECHLLVLKALQDGRAYG S PWCNKQI TRCLI EC RDEYKYNVEAVELL 

IRNHLVNMQQYDLHLAQSMENGIJiYMAVAFAMQLVKILLVDERSVAHVTElADLFHTI ETLMR INAHSRGN 

AP EGL PQLMEWR SNYEAMI DRAHGG PNFMMH SG X S QAS EY DDP PGLREKAEYLLREWVNL YH S AAAGRD 

STKAF S AFVGQMHQQGI LKTDDLI TRF FRLCTEMCVEI S YRAQAEQQHNPAANPTMIRAKC YHNLDAFVR 

LIALLVKHSGEATNTVTKINLLNKVIX5IVVGVLLQDHDVRQSEFQQLPYHRIFIMLLLELNA 

NFQTLTAFCNTFHILRPTKAPGFVYAV^ELISHRIFIARMIAHTPQQKGWPMYAQLLIDLFKYLAPFLRN 

VELTKPMQILYKGTLRVLLVLLHDFPEFLCDYHYGFCDVIPPNCIQLRNLILSAFPRNMRLPDPFTPNLK 

VDMLSEINIAPRILTNFTGVMPPQFKKDLDSYLKTRSPVTFLSDLRSNLQVSNEPGNRYNLQLINALVLY 

VGTQAIAHIHNKGSTPSMSTITHSAHMDIFQNLAVDLDTEGRYLFLNAIANQLRYPNSHTHYFSCTMLYL 

FAEANTEAIQEQITRVLLERLIVNRPHPWGLLITFIELIKNPAFKFWNHEFVHCAPEIEKLFQSVAQCCM 

GQKQAQQVMEGTGAS 

>gi|10047283|dbj|BABl3430.l| KIAA1604 protein [Homo sapiens] 

TVKEFRALGTCYLDQFEASLIYWTNWQQKMKSSVAQIKPSSGHDRRENt^NSYQRNSSPEDRYEEQERSPR 

DRDYFDYSRSDYEHSRRGRSYDSSMESRNRDREKRRERERDTDRKRSRKSPSPGRRNPETSVTQSSSAQD 

EPATKKKKDELDPLLTRTGGAYIPPAKLRMMQEQITDKNSLAYQRMSWEALKKSINGLINKVNISNISII 

IQELLQENIVRGRGLLSRSVLQAQSASPIFTHVYAALVAIINSKFPQIGELILKRLILNFRKGYRRNDKQ 

LCLTASKFVAHLINQNVAHEVLCLEMLTLLLERPTDDSVEVAIGFLKECGLKLTQVSPRGINAIFERLRN 

ILHESEIDKRVQYMIEVMFAVRKDGFKDHPIILEGLDLVEEDDQFTHMLPLEDDYNPEDVLNVFKMDPNF 

MENEEKYKAIKKEILDEGDTDSOTDQDAGSSEEDEEEEEEEGEEDEEGQKVTIHDKTEINLVSFRRTIYL 

AIQSSLDFEECAHKLLKMEFPESQTKELCNMILDCCAQQRTYEKFFGLLAGRFCMLKKEYMESFEGIFKE 

QYDTIHRLETNKLRNVAKMFAHLLYTDSLPWSVLECIKLSEETTTSSSRIFVKIFFQELCEYMGLPKLNA 

RLKDETLQPFFEGLL PRDNPRNTRFAINFFTS IGLGGLTDELREHLKNTPKVI VAQKPDVEQNKS S P S S S 

SSASSSSESDSSDSDSDSSDSSSESSSEESDSSSISSHSSASANDVRKKGHGKTRSKEVDKLIRNQQTND 

RKQKERRQEHGHQETRTERERRSEKHRDQNSSGSNWRDPITKYTSDKDVPSERNNYSRVANDRDQEMHID 

LENKHGDPKKKRGERRNSFSENEKHTHRIKDSENFRRKDRSKSKEMNRKHSGSRSDEDRYQNGAERRWEK 

SSRYSEQSRESKKNQDRRREKSPAKQK 

>gi I 14042953 | ref | NP_114420 . 1 | FKSG17 [Homo sapiens] 

MPGEATETVPAIEQQLLQPQAETGSGTESDSDESVPELEEQDSTQVTAQVQLWAAEIDEEPVSKAKQRR 

sekkarkarfio^lqqvtgvtrvtirksknilfvitkpd^ 

kfkvqgeavsniqentqtptvqegsedeevdetgveikdielvlsqanvwgakavralknsndivnaime 

LTM 
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>gi|2224667|dbj|BAA2 0818.1| KIAA0363 [Homo sapiens] 

EPCALTPGPSHLALTFLPSKPGARPQPEGASWDAGPGGAPSAWADPGEGGPSPMLLPEGLSSQALSTEAP 
L P ATLE P R I VMGEETCQALL S PRAARTALRDQEGGHAS PD P P PELC S QGDLS VP S PPPDPDSFFTPP ST P 
TKTTYALLPACGPHGDARDSEAELRDELLDSPPASPSGSYITADGDSWASSPSCSLSLLAPAEGLDFPSG 
WGLSPQGSMVDERELHPAGTPEPPSSESSLSADSSSSWGQEGHFFDLDFLANDPMIPAALLPFQGSLIFQ 
VEAVEVTPLSPEEEEEEAVADPDPGGDLAGEGEEDSTSASFLOSLSDLSITEGMDEAFAFRDDTSAASSD 
SDSASYAEADDERLYSGEPHAQATLLQDSVQKTEEESGGGAKGLQAQDGTVSWAVEAAPQTSDRGAYLSQ 
RQELISEVTEEGLALGQESTATVTPHTLQVAPGLQVEVATRVTPQAGEEETDSTAGQESAAMAMPQPSQE 
GISEILGQESVTAEKLPTPQEETSLTLCPDSPQNLKEEGGLDLPSGRKPVAAATIVPRQAKEDLTLPQDS 

AMTP PL PLQDTDL S S A P K PVAAAT I VSQQAEEGLTL P QDS VMT P PL PLQDT EL S S APK PVAAATLV S QQA 
EEGLTLPQDSAMTPPLPLQDTDLSSAPKPVAAATLVSQQAEEGLTLPQDSAMTPPLPLQDTDLSSAPKPV 
AAATLVSQQAEEGLTLPQDSAMTPPLPLQDTDLSSAPKPVAAATIVSQQAEEGLTLPQDSAMTPPLPLQD 
TDLSSAPKPVAAATIVSQQAEEGLTLPQDSAMTPPLPLQDTDLSSAPKPVAAATPVSQQAEEGLTLPQDS 
AMTPPLPLQDTDLSSAPKPVAAATPVSQQAEEGLTLPQDSAMTAPLPLQDTGPTSGPEPLAVATPQTLQA 
EAGCAPGTEPVATMAQQEVGEALGPRPAPEEKNAALPTVPEPAALDQVQQDDPQPAAEAGTPWAAQEDAD 
STLGMEALSLPEPASGAGEEIAEALSRPGREACLEARAHTGDGAKPDSPQKETLEVENQQEGGLKLLAQE 
HGPRSALGGAREVPDAPPAAC PEVSQARLLSPAREERGLSGKSTPEPTLPSAVATEASLDSC PESSVGAV 
SSLDRGCPDAPAPTSAPTSQQPEPVLGLGSVEQPHEVPSVLGTPLLQPPENLAKGQPSTPVDRPLGPDPS 
APGTLAGAALPPLEPPAPCLCQDPQEDSVEDEEPPGSLGLPPPQAGVQPAAAAVSGTTQPLGTGPRVSLS 
PHSPLLSPKVASMDAKDLALQILPPCQVPPPSGPQSPAGPQGLSAPEQQEDEDSLEEDSPRALGSGQHSD 
SHGESSAELDEQDILAPQTOQCPAQAPAGGSEETIAKAKQSRSEKKARKAMSKLGLRQIQGVTRITIQKS 
KNILFVIAKPDVFKSPASDTYWFGEAKIEDLSQQVHKAAAEKFKVPSEPSALVPESAPRPRVRLECKEE 
EEEEEEEVDEAGLELRDIELVMAQANVSRAKAVRALRDNHSDIVNAIMELTM 

>gi|18557511|ref |XP_087502.1| similar to alpha NAC/1.9.2. protein [Homo 
sapiens] 

MPCEATETVPATEQELPQPQAETGSGTESDSDESVPEPEEQDSTQTSTQEAQLVAAAEIDEEPVSKAKQT 

WS EKKAQ KAM S KLGLRQVTGVTRVAI RKSKN I LFV I TK PNVC KS P ALDT Y I V SG EAKI EDL S EQ AQL AAA 
EKFTWQGEAVSNIQENTQTPTVQKESEEEETOETGVEVKDIELWSQANVW 

MELTM 

>gi | 5031931 | ref |NP_005585 . 1 | nascent-polypeptide-associated complex 
alpha polypeptide [Homo sapiens] 

MPGEATETVPATEQELPQPQAETGSGTESDSDESVPELEEQDSTQATTQQAQLAAAAEIDEEPVSKAKQS 
RSEKKARKAMSKLGLRQVTGVTRVTIRKSKNILFVITKPDVYKSPASDTYIVFGEAKIEDLSQQAQLAAA 

EKFKVQGEAVSNIQENTOTPWQEESEEEEVDETGVEVKDIELVMS 
MELTM 

>gi | 4758734 | ref |NP_004270.1| peptidase (mitochondrial processing) beta; 
mitochondrial processing peptidase-beta [Homo sapiens] 

MAAAAARVVLSSAARGGLWGFSESLLIRGAAGRSLYFGENRLRSTQAATQVVLNVPETRVTCLESGLRVA 
SEDSGLSTCTVGLWIDAGSRYE^EKNNGTAHFLEHMAFKGTKKRSQLDLELEIENMGAHLNAYTSREQTV 
YYAKAFSKDLPRAVEILADIIQNSTLGEAEIERERGVILREMQEVETNLQEVVFDYLHATAYQNTALGRT 
ILGPTENIKSISRKDLVDYITTHYKGPRIVLAAAGGVSHDELLDLAKFHFGDSLCTHKGEIPALPPCKFT 
GSEIRVRDDKMPLAHLAIAVEAVGWAHPDTICLMVANTLIGNWDRSF^ 

OS FNT S YTDTGLWGLYMVC ES STVADMLHWQKEWMRLCTSVTES EVARARNLLKTNMLLQLDGSTP IC E 
DIGRQMLCYNRRIPIPELEARIDAVNAETIREVCTKYIYNRSPAIAAVGPIKQLPDFKQIRSNMCWLRD 

>gi|4507841|ref |NP_003356.1| ubiquinol -cytochrome c reductase core 

protein I (Homo sapiens] 

MAAS WCRAATAGAQVLLRARRSPALLRTPALRSTATFAQALQFVPETQVSLLDNGLRVAS EQS S QPTCT 

VGVWIDVGSRFETEKNNGAGYFLEHLAFKGTKNRPGSALEKEVESMGAHLNAYSTREHTAYYIKALSKDL 

PKAVELLGDIVQNCSLEDSQIEKE1RDVILREMQENDASMRDVWNYLHATAFQGTPLAQAVEGPSENVR 

LSRADLTEYLSTHYKAPRMVLAAAGGVEHQQLLDLAQKHLGG I PWT Y AEDAVPTLT PC RFTGS E I RHRDD 

ALPFAHVAIAVEGPGWASPDSVALQVANAIIGHYDCTYGGGVHLSSPLASGAVANKLCQSFQTFSICYAE 

TGLLGAHFVCDRMKIDDMMFVLQGQWMRLCTSATESEVARGKNILRNALVSHLDGTTPVCEDIGRSLLTY 

GRRIPLAEWESRIAEVDASWREICSKYIYDQCPAVAGYGPIEQLPDYNRIRSGMFWLRF 
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>gi | 26051278 | ref |NP_742017 . 1 | nuclear pore membrane protein 121; 
nuclear pore membrane protein 121 kDa; nuclear envelope pore membrane 
protein POM 121; P145 [Homo sapiens] 

MVCSPVTWIAPPDRRFSRSAIPEQIISSTLSSPSSNAPDPCAKETVLSALKEKEKKRTVEEEDQIFLDG 
QENKRRRHDSSGSGHSAFEPLVANGVPASFVPKPGSLKRGLNSQSSDDHLNKRSRSSSMSSLTGAYASGI 
PSSSRNAITSSYSSTRGISQLWKRNGPSSSPFSSPASSRSQTPERPAKKIREEELCHHSSSSTPIiAADRE 
SQGEKAADTTPRKKQNSNSQSTPGSSGQRKRKVQLLPSRRGEQLTLPPPPQLGYSITAEDLDLEKKASLQ 
WFNQAI/EDKSDAASNSV^ETPPITQPSFTFTLPAAAPASPPTSLLAPSTOTIiESLKKMOTPPSLPPCPE 
SAGAATTEALSPPKTPSLLPPLGLSQSGPPGLLPSPSFDSKPPTTLLGLIPAPSMVPATDTKAPPTLQAE 
TATKPQATSAPSPAPKQSFLFGTQNTSPSSPAAPAASSAPPMFKPIPTAPPKSEKEGPTPPGPSVTATAP 
SSSSIiPTTTSTTAPTFQPWSSMGPPASVPLPAPFFKQTTTPATAPTTTAPLFTGLASATSAVAPITSAS 
PSTDSASKPAFGFGINSVSSSSVSTTTSTATAASQPFLFGAPQASAASFTPAMGSIFQFGKPPALPTTTT 
VTTFSQSLHTAVPTATSSSAADFSGFGSTLATSAPATSSQPTLTFSNTSTPTFNIPFGSSAKSPLPSYPG 
ANPQPAFGAAEGQPPGAAKPALAPSFGSSFTFGNSAAPAAAPTPAPPSMIKWPAYVPTPIHPIFGGATH 
SAFGLKATASAFGAPASSQPAFGGSTAVFFGAATSSGFGATTQTASSGSSSSVFGSTTPSPFTFGGSAAP 
AGSGSFGIWATPGSSTTTGAFSFGAGQSGSTATSTPFAGGLGQNALGTTGQSTPFAFNVSSTTESKPVF 
GGTATPTFGLNTPAPGVGTSGSSLSFGASSAPAQGFVGVAPFGSAALSFSIGAGSKTPGARQRLQARRQH 
TRKK 

>gi (4506181 | ref |NP_002778 . 1 | proteasome alpha 2 subunit; proteasome 

subunit HC3; proteasome component C3; macropain subunit C3; 

multicatalytic endopeptidase complex subunit C3 [Homo sapiens] 

MAERGYSFSLTTFSPSGKIiVQlEYALAAVAGGAPSVGIKAANGVVTiATEKKQKSILYDERSVHKVEPITK 

HIGLWSGMGPDYRVXVliRARKLAQQYYLVYQEPIPTAQLVQRVASVMQEYTQSGGVRPFGVSLLICGWN 

EGRPYLFQSDPSGAYFAWKATAMGKNYVNGKTFLEKRYNEDLELEDAIHTAILTLKESFEGQMTEDNIEV 

GICNEAGFRRLTPTEVKDYLAAIA 

>gi | 4506183 | ref |NP_002779 . 1 1 proteasome alpha 3 subunit isoform 1; 
proteasome subunit C8; macropain subunit C8; multicatalytic 
endopeptidase complex subunit C8 [Homo sapiens] 

MSSIGTGTOLSASTFSPDGRVTQVEYAMKAVENSSTAIGIRCKIX^A^GVEKLVXSKLYEEGSI^RLFW 
DRHVGMAVAGLLADARSLADIAREEASNFRSNFGYNI PLKHLADRVAMYVHAYTLYSAVRPFGCSFMLGS 
YSVNEK3AQLYMIDPSGVSYGYWGCAIGKARQAAKTEIEKLQMK^ 
ELELSWVGELTNGRHEIVPKDIREEAEKYAKESLKEEDESDDDNM 

>gi (4506193 | ref | NP_002784 . 1 | proteasome beta 1 subunit; proteasome 
subunit HC5; proteasome component C5; macropain subunit C5; proteasome 
gamma chain; multicatalytic endopeptidase complex subunit C5 [Homo 
sapiens] 

MLSSTAMYSAPGRDLGMEPHRAAGPLOLRFSPYVFNGGTIIiAIAGEDFAIVASDTRLSEGFSIHTRDSPK 
CYKLTDKTVIGCSGFHGDCLTIiTKIIEARLKI^KHSNNKAMTTGAIAAMLSTILYSRR 
LDEEGKGAVYSFDPVGSYQRDSFKAGGSASAMLQPLLDNQVGFKNMQNVEHVPLSLDRAMRLVKDVFISA 
AE RDVYTGD AL R I C I VT KEG I REETV SLRKD 

>gi | 20539647 | ref | XP_167259 . 1 1 similar to protease (prosome, macropain) 
26S subunit, ATPase 1 [Rattus norvegicus] [Homo sapiens] 
MSWn.EEIIDDNHAIMSTSSlLWKDLLEPGCSV^^ 

DTGGLD S Q IQEIKEII LNI MKRWK YLHDG PKLI WELFLV AE EHA P S I MF I DEI DA I RTKRC DSNS D S ERE 
IQQIMLEMLNQLDGFDSRGDVKVIISTSRIETLDLALIRPGYTDRKLKFPLPDEKTKKHIFQMHTSSMTL 
ANDAICTEASLMALKEHGMJCVTNENFKKSQENVLYKEQEDTPKGLCL 

>gi | 27479738 | ref | XP_210528 . 1 | similar to protease (prosome, macropain) 
26S subunit, ATPase 1 [Rattus norvegicus] [Homo sapiens] 
MGESQSGGYGPGVCKKNGRTRKINMNLLYQLEWRRRKQRDQM^ 

K\^KTPQETYADTGGKEIKESV1<PPLTQHEYYEETGVKPPTGWGVILYGPPGTVAEEHAPSTVFIDDINV 
TGTKR YDSNS GG E I AI QRTMLELWNQLDG FDS RGDVKMHT SMMTLADDVNLDDL IMT KDDLFGAD I KA I C 



80 



EX03-029P 



TE AC LMALRE PQNE ENVLCKKQEGT P KG L YL 

>gi | 29727184 | ref |XP_292848 . 1 | similar to protease (prosome, macropain) 
26S subunit, ATPase 1 [Rattus norvegicus] [Homo sapiens) 
MGQSQSCX3HGPGSGKKDDKDKKKKYEPSVPTRVRKKK 

YTjLMEEEFIRNQEQMKPLEEKQKEERSKVDDLRGTPIQGSSGNWLLGPAQPQGACRDGVLMDDTDPLVTV 

MKMEKTPQETYANIGGLDNQIQEIKEAVEFPLTHPEYYEEMGKKPPKGVILYGPPGTGKTLLSKAVANQT 

SATFLRWGSELIQKYLVAEEHELSTMFTDETGAIGTKRYDSNSGGRIDRKIKFSLPDERTKKRIFQNHT 

SRMTLADEWLDDLIMXnCDDLSGADIKAICTEAGLMASRERRMKVTNEFFKKYKEMF 

SSEPQLPSGKWLGDSSTPEKDEQLVPKAGEDTPEGYLGQLMTQLFEQKEAKKFSRKSPELLEELASSUV 

>gi|29736610 |ref |XP_293923 .1 | similar to protease (prosome, macropain) 
26S subunit, ATPase 1 [Homo sapiens) 
MIVEVEELPAVAAQVAKTRVAEKEECTKGQDVASKLPLVT^ 

QI KLLEEKQE EGRS KVDDLRGTPMSVGNLEE I 1 DDNHAI V S T SVG S EH YD S IIS FVEKDLL E PGC S ILL R 
HKVHAVIGVLMDDTGPLVTMMKVEKAPQETYVNTGGLDNQI QEI KESMELPLPHPEYYEEMGTKPPKGVI 
LCGPPGTATFLRWGSELIQKYLGDGPKLVRQVFQVAEEHAPSIMFTDEIEAIGTKRYDSNSGGEREIQQ 
TMLELELI^QLGGFDSREDVKVIMATKQVETLDPVLIRPGRIDKKIEFHLPDEKTKKHIFQIHTSRMTLA 
NDVTLDDLIMAKDDFSGADIKAICTEAGLMALREHRMKATNEDFKKSIESVLYKKHEGIPEGLYL 

>gi | 29740164 | ref |XP_294034 . 1 | similar to protease (prosome, macropain) 
26S subunit, ATPase 1 [Rattus norvegicus) [Homo sapiens] 
MSVGTLEEINDSHAIVSTSVGSGHYANILSFVDKDLLATGCSVLL^ 

KVPQETYA^^^GRLDNQIQEIKESVELPLTHAEHYEEMGIKPPKGVILYGPPGIVKTLLAKAVANQTSATF 
LKVVGSELIQKYLGDGPKLTOELFQVAGEHAPSIWTDETDAIGTKRYDSNSGEAGLMALRERRMKATNE 
DFKKSKENVLYKKQEGTPEGLSL 

>gi | 29737532 | ref |XP_294368 . 1 | similar to protease (prosome, macropain) 
26S subunit, ATPase 1 [Homo sapiens) 

MVQSQSGGHGPGGGKKDDKDKKKKYKPPVPTRVGKKKKKTEGSYAASKLPLVTPHTQCQLKLLKKQKQME 
PLEEKQEEERSKVTDDLRGTPMSVGTLEEIINDSHAWSTSVGSERYVSILSFVDKDLLEPGCLVLLNHKV 
HAMIGVLMDNl^PLvTvTIKGLDNQSQEIKESVELPLTHPEYYEEMGIKPPKGVIHYGPPGTEYALSIMFI 
DEIDAIGTKRYDSNSGSEREIQRSALELLNQSDGFDSRGDVKVIMATNQIETLDPALIRPGCIDRKIEFP 
LHDENTKKRIFQIHTIRMTLANDVWjDDLIVTRDDLSGADIKAICTEVGLMASRECRMQVKNEDFQKSKE 
NVLYKKQEGTPERLYL 

>gi | 24430151 | ref |NP_002793 . 2 | proteasome 26S ATPase subunit 1; 
proteasome 26S subunit, ATPase, 1; 26S protease regulatory subunit 4 
[Homo sapiens) 

MGQSQSGGHGPGGGKKDDKDKKKKYEPPVPTRVGKKKKKTKGPDAASKLPLVTPHTQCRLKLLKLERIKD 
YLLMEEEFIRNQEQMKPLEEKQEEERSKVDDLRGTPMSVGTLEEIIDDNHAIVSTSVGSEHYVSILSFVD 
KDLLEPGC SVLLNHKVTIAVIGVTjMDDTDPLVTVMKVEKAPQETYADIGGLDNOIQEIKESVELPLTH PEY 
YEEMGIKPPKGVILYGPPGTGKTLLAKAVANQTSATFLRWGSELIQKYLGDGPKLVRELFRVAEEHAPS 

ivfideidaigtkrydsnsggereiqrtmlellnqldgfdsrgdvkvimatnrietldpalirpgridrk 

iefplpdektkkrifqihtsrmtiaddvtlddlimaxddlsgad 

kskenvlykkqegtpeglyl 

>gi (25777602 |ref | NP_002799 . 3 | proteasome 26S non-ATPase subunit 2; 
tumor necrosis factor receptor-associated protein 2; TNFR- associated 
protein 2; 55.11 protein; 26S proteasome non-ATPase regulatory subunit 
2; 26S proteasome regulatory subunit S2 ; 26S proteasome subunit p97 
[Homo sapiens] 

MEEGGRDKAPVQPQQSPAAAPGGTDEKPSGKERRDAGDKDKEQELSEEDKQLODELEMLVERLGEKDTSL 
YRPALEELRRQIRSSTTSOTSVPKPLKFLRPHYGKLKEIYENMAPGENKRFAADIISVIiAMTMSGERECL 
KYRLVGSQEELASWGHEYWHLAGEVAKEWQELDDAEKVQREPLLTLVXEIVPY10MAHNAEHEACDLLME 
IEQVI)MLEKDIDENAYAKVCLYLTSCVNYVPEPENSALLRCALGVFRKFS 

IFTSCKDVWQKQMAFMLGRHGVFLELSEDVEEYEDLTEIMSNVQLNSNFLAIARELDIMEPKVPDDIYK 



81 



EX03-029P 



THLENNRFGGSGSQVDS ARMNLAS S FVNGFVNAAFGQDKXjLTDDGNKWLYKNKDHGMLSAAASIiGMI LLW 
DVDGGLTQIDKYLYSSEDYIKSGALLACGIVNSGVRireCDPAIiALLSDYV^ 

GSNREDVLTLLLPVMGDSKSSMEVAGVTALACGMIAVGSCNGDVTSTILQTIMEKSETELKDTYARWLPL 
GLGIiNHLGKGEAIEAILAALEWSEPFRSFANTLVDVCAYAGSGNVLKVQQLLHICSEHFDSKEKEEDKD 
KKEKKDKDKKEAPADMGAHQCVAVLGIALIAMGEEIGAEMALRTFGHLLRYGEPTLR 
NPRLNILDTLSKFSHDADPEVSYNSIFAMGMVGSGTNNARLAAMLRQLA^^ 

LGKGTLTIXTPYHSDRQLMSQVAVAGLLTVLVSFLDVRNIILGKSHYVLYGLVAAMQPRMLVTFDEELRPL 
PVSVRVGQAVDWGQAGKPKTITGFQTHTTPVLLAHGERAELATEEFLPVTPIIiEGFVILRKNPNYPL 

>gi | 6382079 | ref |NP_006258.2 | RAN binding protein 2; nucleoporin 358; 
nuclear pore complex protein Nup358; P270 [Homo sapiens] 
MRRSKADVERYIASVQGSTPSPRQKSMKGFYFAKLYYEAKEYDLAKKYICTYINVQERDPKAHRFLGLLY 
EL E ENTDKAVEC YRRS VELN PTQKDLVLKI AELLC KNDVTDG RAKYWLERAAKLF PGS P AI YKLKEQLLD 
CEGEIX^KLFDLIQSELYVRPDDVHVNIRLVEVYRSTKRLKDAVAHCHEAERNIALRSSLEWNSCVVQT 
LKEYLESLQCLESDKSDWRATmTILLlAYAOT^LTLST^ 

TFLEMKGHFYMHAGSLLLKMGQHSSWQWRALSELAAIXTYLIAFQVPRPKIKLIKGEAGQNLLEMMACDR 
LS QSGHMLLNL S RGKQDF L KE I VETF AN KSGQ S AL YD AL F S S Q S PKDTS FLG SDD I GN I DVRE P ELEDLT 
RYDVGAIRAHNGSLQHLTV^LQVWSLPALPGIRKWLKQLFHHLPHETSRLETNAPESICILDLEVFLLG 
VVYTSHUJLKEKCNSHHSSYQPLCLPLPVCKQLCTERQKSWWDAVCTLIHRKAVPGNVAKLRLLVQHEIKT 
TLRAQEKHGLQ P ALLVHWAEC LQKTG SG DNS F YDQRE Y I GRS VH YWKKVL PhhK 1 1 KKKNS IPEPIDPLF 
KHFHSVDIQASEIVEYEEDAHITFAILDAVNGNIEDAVTAFESIKSWSYWNLALIFHRKAEDIENDALS 
PEEQEECKNYLRKTRDYLI KI IDDSDSNLSWKKLPVPLESVKEMLNSVMQELEDYSEGGPLYKNGSLRN 
ADSEIKHSTPSPTKYSLSPSKSYKYSPKTPPRWAEDQNSLLKMICQQVEAIKKEMQELKLNSSNSASPHR 
WPTENYGPDSVPDGYQGSQTFHGAPLTVATTGPSVYYSQSPAYNSQYLLRPAANVTPTKGPVYGMNRLPP 
QQHIYAYPQQMHTPPVQSSSACMFSQEMYGPPALRFESPATGILSPRGDDYFNYNVQQTSTNPPLPEPGY 
FTKPPIAAHASRSAESKTIEFGKTNFVQPMPGEGLRPSLPTQAHTTQPTPFKFNSNFKSNDGDFTFSSPQ 
VVTQPPPAAYSNSESLLGLLTSDKPLQGDGYSGAKPIPGGQTIGPRNTFNFGSKNVSGISFTENMGSSQQ 
KNSGFRRSDDMFTFHGPGKSVFGTPTLETANKNHETDGGSAHGDDDDDGPHFEPWPLPDKIEVKTGEED 
EEBFFCIsmAKLFRFDVESKEWKERGIGNVKILRHKTSGKIRLL^REQVLKICANHYISPDMKLTPNAGS 
DRSFVWHALDYADELPKPEQLAI RFKTPEEAALFKC KFEEAQS I LKAPGTNVAMASNQAVRI VKEPTSHD 
NKDICKSDAGNLNFEFQVAKKEGSV^CNSCSLKNASTAKKCVSCQm^PSNKELVGPPUVETVFTPKTS 
P ENVQDRF ALVT PKK EG HWDC S I CLVRNE PTVSRC I AC QNT KS ANKSGS S FVHQ AS FKFGQGDL PK P I NS 
DFRSVFSTKEGQWDCSACLVQNEGSSTKCAACQNPRKQSLPATSIPTPASFKFGTSETSKTLKSGFEDMF 
AKKEGQWDCSSCLVRNEANATRCVACQNPDKPSPSTSVPAPASFKFGTSETSKAPKSGFEGMFTKKEGQW 
DCSVCLVRNEASATKCIACQNPGKQNQTTSAVSTPASSETSKAPKSGFEGMFTKKEGQWDCSVCLVRNEA 
SATKCIACQNPGKQNQTTSAVSTPASSETSKAPKSGFEGMFTKKEGQWDCSVCLVRNEASATKCIACQCP 
SKQNQTTAISTPASSEISKAPKSGFEGMFIRKGOWDCSVCCVQNESSSLKCVACDASKPTHKPIAEAPSA 
FTLGSEMKLHDSSGSQVGTGFKSNFSEKASKFGNTEQGFKFGHVDQENSPSFMFQGSSNTEFKSTKEGFS 
I PVSADGFKFGI SEPGNQEKKSEKPLENGTGFQAQDISGQKNGRGVI FGQTSSTFTFADIAKSTSGEGFQ 
FGKKDPNFKGFSGAGEKLFSSQYGKMANKANTSGDFEKDDDAYKTEDSDDIHFEPWQMPEKVELVTGEE 
DEKVLYSQRVKLFRFDAEVSQWKERGLG^KILKNEVN^ 

SDRAWMWL ASDF SDGD AKL EQL AAK F KTPEL AEEFKQKFE E C QRLLLD I PLQT PHKLVDTGRAAKL I QRA 
EEMKSGLKDFKTFLTNDQTKVTEEENKGSGTGAAGASDTTIKPNPEOT 

SSVHASPLASSPVRKI^FRFGESTTGFNFSFKSALSPSKSPAKLNQSGTSVGTDEESDVTQEEERDGQYF 
EPWPLPDLVEVSSGEENEQVVFSHRAKLYRYDKDVGQWKERGIGDIKILQNYDNKQVRIVMRRDQVLKL 
CANHRITPDMTLQNMKGTERVWliWTACDFADGERKVEHLAVRFKLQDVADSFKKIFDEAKTAQEKDSLIT 
PHVSRSSTPRESPCGKIAVAVLEETTRERTDVIQGDDVADATSEVEVSSTSETTPKAWSPPKFVFGSES 
VKSIFSSEKSKPFAFGNSSATGSLFGFSFNAPLKSNNSETSSVAQSGSESKVEPKKCELSKNSDIEQSSD 
SKVKNLFASFPTEESSIhTYTFKTPEKAKEKKKPEDSPSDDDVLIVYELTPTAEQKALATKLKIjPPTFFCY 
KNRPD YVSEE EEDDEDFETAVKKLNGKL YLDGS EKCRPLEENTADNEKEC I IVWEKKPTVE EKAKADTLK 
LPPTFFCGVCSDTDEDNGNGEDFQSELQKVQEAQKSQTEEITSTTDSVYTGGTEVMVPSFCKSEEPDSIT 
KSISSPSVSS ETMDK PVDL STRKE I DTD S T SQG E S K I VS FGFG S STGL S F ADLAS SNS GDF AFG S KDKN F 
QWANTCAAVFGTQSVGTQSAGKVGEDEDGSDEEVVrHNEDIHFEPIVSLPEVEVKSGEEDEEILFKERAKL 
YRWDRDVSQWKERGVGDIKILWHTMKimRIX^RRDQW 

ADGEAKVEQLAVRFKTKEVADCFKKTFEECQQNLMKLQKGHVSLAAELSKETNPWFFDVCADGE 
TMELFSNIVPRTAENFRALCTGEKGFGFKNSIFHRVIPDFVCQGGDITKHDGTGGQSIYGDKFEDENFDV 
KHTG PGLIiSMANQGQNTNN S QFV I TLKKA EHLDFKHWFGFVKDGMDTVKK I E S FG S PKG S VC RR I TI TE 
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CGQI 
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